llama.cpp
58062860 - ggml : use WARP_SIZE/2 for argmax reduction offset (#18092)

Commit
44 days ago
ggml : use WARP_SIZE/2 for argmax reduction offset (#18092)
Author
Parents
Loading