llama.cpp
CUDA: faster non k-quant mul_mat_q kernels
#2483
Merged

Loading