llama.cpp
CUDA: mul_mat_vec_q for batch sizes > 1
#5351
Merged

Loading