llama.cpp
8b8b88f3
- ggml-quants : restore Q2_K use of make_qp_quants
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
345 days ago
ggml-quants : restore Q2_K use of make_qp_quants Weirdly, it seems like in practice replacing this instance is not better. This is probably because of its interaction with make_qkx3_quants.
References
#12557 - ggml-quants : weighted rounding algorithms with cumulative search
Author
compilade
Parents
a4113972
Loading