llama.cpp
30ad9c28
- ggml-quants : faster exhaustive IQ4_NL rounding with k_heap
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
352 days ago
ggml-quants : faster exhaustive IQ4_NL rounding with k_heap
References
#12557 - ggml-quants : weighted rounding algorithms with cumulative search
Author
compilade
Committer
compilade
Parents
0c9e4424
Loading