llama.cpp
CUDA: refactor mmq, dmmv, mmvq
#7716
Merged

Loading