llama.cpp
CUDA: fix mul_mat_q not used for output tensor
#3127
Merged

Loading