[1/5] AArch64: Improve A64FX memset for small sizes
commit07b427296b8d59f439144029d9a948f6c1ce0a31
authorWilco Dijkstra <wdijkstr@arm.com>
Tue, 10 Aug 2021 12:30:27 +0000 (10 13:30 +0100)
committerWilco Dijkstra <wdijkstr@arm.com>
Tue, 10 Aug 2021 12:30:27 +0000 (10 13:30 +0100)
treea07eb48b7dbd93af15343faad65d41159ff835e1
parent1d7b32ee6145c46c4f4f8a208a6b72e0668d7cf3
[1/5] AArch64: Improve A64FX memset for small sizes

Improve performance of small memsets by reducing instruction counts and
improving code alignment. Bench-memset shows 35-45% performance gain for
small sizes.

Reviewed-by: Naohiro Tamura <naohirot@fujitsu.com>
sysdeps/aarch64/multiarch/memset_a64fx.S