llama.cpp
9fd1e83f - Use Q4_K for attn_v for Q2_K_S when n_gqa >= 4

Commit
2 years ago
Use Q4_K for attn_v for Q2_K_S when n_gqa >= 4
Author
Iwan Kawrakow
Committer
Iwan Kawrakow
Parents
Loading