libFLAC/lpc_intrin_sse.c : New SSE code to calculate autocorrelation.
commite9d805dd4374cead02a4e991515bae3bf9547e43
authorErik de Castro Lopo <erikd@mega-nerd.com>
Sat, 9 Aug 2014 02:32:57 +0000 (9 12:32 +1000)
committerErik de Castro Lopo <erikd@mega-nerd.com>
Sat, 9 Aug 2014 02:49:27 +0000 (9 12:49 +1000)
treeecdbdea901734e4c6c663796414db3518e4a6483
parent0ae3e4689dbd69d9452c0e3e66b1cd54e10b7101
libFLAC/lpc_intrin_sse.c : New SSE code to calculate autocorrelation.

Accelerate FLAC__lpc_compute_autocorrelation_intrin_sse_lag_NN routines for
AMD and newer Intel CPUs (means Core i aka Nehalem and newer). Unfortunately
it's slower on older Intel CPUs.

According to tests at HA:

    <http://www.hydrogenaud.io/forums/index.php?s=&showtopic=101082&view=findpost&p=870753>

  CPU                 flac -5           flac -8

  Athlon XP           +5 %              +2.4 %
  Athlon 64 X2        +9 %              +4 %
  Core i              +7 %              +1 % ... +2.7 %
  Core 2              ?                 -3.5 %

According to Steam HW survey <http://store.steampowered.com/hwsurvey/>
69% of Steam users have SSE4.2 which means that the new code is faster for
them. There are also AMD users that don't have SSE4.2, so 75% of Steam users
should benefit from this patch.

Patch-from: lvqcl <lvqcl.mail@gmail.com>
src/libFLAC/lpc_intrin_sse.c