llama.cpp
3e4b675c - ggml-quants : use a max-heap for TQ1_0 and TQ2_0 quantization

Commit

264 days ago

ggml-quants : use a max-heap for TQ1_0 and TQ2_0 quantization

References

#12557 - ggml-quants : weighted rounding algorithms with cumulative search

Author

compilade

compilade

Committer

compilade

compilade

Parents

Loading