llama.cpp
4a3156de - CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938)

Commit
1 year ago
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Author
Parents
Loading