llama.cpp
CUDA: use tensor cores for MMQ
#7676
Merged

Loading