llama.cpp
3be11510 - ggml-quants : use a max-heap for linear quants like Q3_K

Commit
272 days ago
ggml-quants : use a max-heap for linear quants like Q3_K Slightly faster than the previous method.
Author
Parents
Loading