llama.cpp
cbc83436 - Make IQ1_M work for QK_K = 64 (#6327)

Commit
1 year ago
Make IQ1_M work for QK_K = 64 (#6327) * iq1_m: make it work for QK_K = 64 (WIP) * iq1_m: make it work for QK_K = 64 (scalar and AVX2) * iq1_m: QK_K = 64 seems to work on Metal and ARM_NEON --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Author
Parents
Loading