llama.cpp
3be11510 - ggml-quants : use a max-heap for linear quants like Q3_K

Commit

272 days ago

ggml-quants : use a max-heap for linear quants like Q3_K Slightly faster than the previous method.

References

Author

compilade

Parents