llama.cpp
468ea24f
- CUDA: faster non k-quant mul_mat_q kernels (#2483)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
CUDA: faster non k-quant mul_mat_q kernels (#2483)
References
#2483 - CUDA: faster non k-quant mul_mat_q kernels
Author
JohannesGaessler
Parents
4f6b60c7
Loading