llama.cpp
CUDA: more warps for mmvq on NVIDIA
#5394
Merged

Loading