llama.cpp
CUDA mul mat vec q kernels for k-quants
#2203
Merged

Loading