Improve memccpy performance by using memchr/memcpy/mempcpy rather than