auto-round
Integrate RTN quantization into GGUF packing to enhance robustness
#644
Merged

Integrate RTN quantization into GGUF packing to enhance robustness #644

wenhuach21 merged 10 commits into main from hengguo/gguf_update_0704
n1ck-guo
n1ck-guo support for gguf packing immediatly
a7c8b177
n1ck-guo Merge branch 'main' into hengguo/gguf_update_0704
f33e6113
n1ck-guo fix
3d6587d4
n1ck-guo if only export gguf, using gguf-packing instead of rtn
37059d4b
wenhuach21 wenhuach21 changed the title support for gguf packing immediatly move rtn quantization to gguf-packing to improve robustness 248 days ago
wenhuach21 wenhuach21 changed the title move rtn quantization to gguf-packing to improve robustness Integrate RTN quantization into GGUF packing to enhance robustness 248 days ago
wenhuach21
n1ck-guo fix make_q3_quants
1ed8f5ca
n1ck-guo support for llama4
b37d270b
wenhuach21
wenhuach21 commented on 2025-07-07
wenhuach21
wenhuach21 commented on 2025-07-07
n1ck-guo sym convert.py
ab62157a
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
36dc69fb
n1ck-guo code scan
bf1c93b7
n1ck-guo fix
17c2c3b3
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 247 days ago
wenhuach21
wenhuach21 approved these changes on 2025-07-09
wenhuach21 wenhuach21 merged 06e0f520 into main 246 days ago
wenhuach21 wenhuach21 deleted the hengguo/gguf_update_0704 branch 246 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone