llama.cpp
CUDA: Quantized matrix matrix multiplication
#2160
Merged

Loading