llama.cpp
f4cb4eac - iq3_s_mult: play with blocks of 16

Commit
1 year ago
iq3_s_mult: play with blocks of 16 This brings the bpw to 3.5625. We come close but don't quite match lookup with 3.4375 bpw (blocks of 32)
Author
Iwan Kawrakow
Parents
Loading