llama.cpp
CUDA: faster k-quant mul_mat_q kernels
#2525
Merged

Loading