llama.cpp
02d69881 - Improve cuBLAS performance by dequantizing on the GPU (#1065)

Commit
2 years ago
Improve cuBLAS performance by dequantizing on the GPU (#1065)
Author
Parents
Loading