llama.cpp
4a3156de
- CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
References
#4938 - CUDA: faster dequantize kernels for Q4_0 and Q4_1
Author
ikawrakow
Parents
a836c8f5
Loading