llama.cpp
30ad9c28 - ggml-quants : faster exhaustive IQ4_NL rounding with k_heap

Commit

352 days ago

ggml-quants : faster exhaustive IQ4_NL rounding with k_heap

References

#12557 - ggml-quants : weighted rounding algorithms with cumulative search

Author

compilade

compilade

Committer

compilade

compilade

Parents

Loading