libFLAC : SSE optimisations.
commit02591f6b4b09e7c6e26bd2ae7225ff2810b227bb
authorErik de Castro Lopo <erikd@mega-nerd.com>
Mon, 28 Jul 2014 10:43:54 +0000 (28 20:43 +1000)
committerErik de Castro Lopo <erikd@mega-nerd.com>
Mon, 28 Jul 2014 10:46:16 +0000 (28 20:46 +1000)
treebe4b7a289503e65483de44e8733375f00b2d3a4f
parent8ce310dda2421ce9932969c4f038ac4f1d84c39f
libFLAC : SSE optimisations.

Add new function:

    FLAC__lpc_compute_residual_from_qlp_coefficients_intrin_sse41()

and rewrite function:

    FLAC__lpc_compute_residual_from_qlp_coefficients_16_intrin_sse2()

Testing shows noticeable speed increase on Intel Core i3/5/7 (up to 30%
for -8 mode), AMD Athlon64, Phenom, Bulldozer/Piledriver, but no increase
or even very small speed decrease (~2% for -8 mode) on Intel Core2.

Patch-from: lvqcl <lvqcl.mail@gmail.com>
src/libFLAC/include/private/lpc.h
src/libFLAC/lpc_intrin_sse2.c
src/libFLAC/lpc_intrin_sse41.c
src/libFLAC/stream_encoder.c