llama.cpp
arm64: optimize q6_k_q8_k kernel with i8mm
#13519
Merged

Loading