llama.cpp
55717c98 - metal : warp-based reduction for soft max kernel

Commit
1 year ago
metal : warp-based reduction for soft max kernel
Author
Committer
Parents
Loading