llama.cpp
5143fa89
- CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
35 days ago
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802) * CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
References
#15802 - CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
Author
JohannesGaessler
Parents
3a550b5c
Loading