llama.cpp
3e4b675c
- ggml-quants : use a max-heap for TQ1_0 and TQ2_0 quantization
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
264 days ago
ggml-quants : use a max-heap for TQ1_0 and TQ2_0 quantization
References
#12557 - ggml-quants : weighted rounding algorithms with cumulative search
Author
compilade
Committer
compilade
Parents
f86b8ff2
Loading