llama.cpp
CUDA: mul_mat_q=true as default for llama_context_params
#2912
Merged

Loading