Integrate RTN quantization into GGUF packing to enhance robustness #644
support for gguf packing immediatly
a7c8b177
Merge branch 'main' into hengguo/gguf_update_0704
f33e6113
fix
3d6587d4
if only export gguf, using gguf-packing instead of rtn
37059d4b
wenhuach21
changed the title support for gguf packing immediatly move rtn quantization to gguf-packing to improve robustness 248 days ago
wenhuach21
changed the title move rtn quantization to gguf-packing to improve robustness Integrate RTN quantization into GGUF packing to enhance robustness 248 days ago
fix make_q3_quants
1ed8f5ca
support for llama4
b37d270b
sym convert.py
ab62157a
[pre-commit.ci] auto fixes from pre-commit.com hooks
36dc69fb
code scan
bf1c93b7
fix
17c2c3b3
wenhuach21
deleted the hengguo/gguf_update_0704 branch 246 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub