cutils: add avx2 instruction optimization
commit28b90d9c19d368645f475e36297ca21c53c38799
authorLiang Li <liang.z.li@intel.com>
Tue, 8 Mar 2016 05:53:17 +0000 (8 13:53 +0800)
committerAmit Shah <amit.shah@redhat.com>
Tue, 8 Mar 2016 11:23:26 +0000 (8 16:53 +0530)
tree20493c1d2f176a292481e5ba900ca1c986b77299
parent99f2dbd34328771b12dee782ebccda1c25b4c23b
cutils: add avx2 instruction optimization

buffer_find_nonzero_offset() is a hot function during live migration.
Now it use SSE2 instructions for optimization. For platform supports
AVX2 instructions, use AVX2 instructions for optimization can help
to improve the performance of buffer_find_nonzero_offset() about 30%
comparing to SSE2.

Live migration can be faster with this optimization, the test result
shows that for an 8GiB RAM idle guest just boots, this patch can help
to shorten the total live migration time about 6%.

This patch use the ifunc mechanism to select the proper function when
running, for platform supports AVX2, execute the AVX2 instructions,
else, execute the original instructions.

Signed-off-by: Liang Li <liang.z.li@intel.com>
Suggested-by: Paolo Bonzini <pbonzini@redhat.com>
Suggested-by: Richard Henderson <rth@twiddle.net>
Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
Message-Id: <1457416397-26671-3-git-send-email-liang.z.li@intel.com>
Signed-off-by: Amit Shah <amit.shah@redhat.com>
include/qemu-common.h
util/cutils.c