llama.cpp
5b263dd8 - cuda : unroll Q*K^T loop

Commit
2 years ago
cuda : unroll Q*K^T loop
Author
Parents
Loading