llama.cpp
334a835a
- ggml : importance matrix support for legacy quants (#4969)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
ggml : importance matrix support for legacy quants (#4969) * imatrix: adding support for legacy quants * imatrix: guard Q4_0/Q5_0 against ffn_down craziness --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
References
#4969 - Importance matrix support for legacy quants
Author
ikawrakow
Parents
4feb4b33
Loading