llvm-project
696805d1
- [AArch64] Improve non-SVE popcount for 32bit and 64 bit using udot (#95881)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[AArch64] Improve non-SVE popcount for 32bit and 64 bit using udot (#95881) fixes #95860 Use `udot` instead of a sequence of `uaddlp` instructions when summing up lanes for `popcount`.
References
#95881 - [AArch64] Improve non-SVE popcount for 32bit and 64 bit using udot
Author
tgymnich
Parents
b48623c2
Loading