llama.cpp
b958151e - cuda : use half2 in softmax

Commit
2 years ago
cuda : use half2 in softmax
Author
Parents
Loading