llama.cpp
02d69881
- Improve cuBLAS performance by dequantizing on the GPU (#1065)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Improve cuBLAS performance by dequantizing on the GPU (#1065)
References
#1065 - Improve cuBLAS performance by dequantizing on the GPU
Author
slaren
Parents
834695fe
Loading