llama.cpp
58062860
- ggml : use WARP_SIZE/2 for argmax reduction offset (#18092)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
44 days ago
ggml : use WARP_SIZE/2 for argmax reduction offset (#18092)
References
#18092 - ggml : use WARP_SIZE/2 for argmax reduction offset
Author
Aadeshveer
Parents
2973a65e
Loading