llama.cpp
f4cb4eac
- iq3_s_mult: play with blocks of 16
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
iq3_s_mult: play with blocks of 16 This brings the bpw to 3.5625. We come close but don't quite match lookup with 3.4375 bpw (blocks of 32)
References
#5867 - IQ3_S: multiplier based code book
Author
Iwan Kawrakow
Parents
dbe98dfe
Loading