llama.cpp
ebd062bc - cuda : use 512 threads for soft_max instead of 32

Commit
1 year ago
cuda : use 512 threads for soft_max instead of 32
Author
Committer
Parents
Loading