llama.cpp
CUDA: fewer memory bank conflicts for mul_mat_q
#2458
Merged

Loading