Support for more gguf format and float zp for Q*_1 #560
support for float zp and q5_0/1
6fd5a4a7
Merge branch 'main' into hengguo/gguf_extend
9027e941
[pre-commit.ci] auto fixes from pre-commit.com hooks
67098964
support for q5_k_s
489b5579
support for q3_k
48669db6
fix
dd6ccdfe
fix & support for q6_k
23961a64
float zp for q*_1, auto switch
b1a6b04c
shorter log for gguf support format
9bab43d1
fix
fd28e99b
add ut
9d87890c
Merge branch 'main' into hengguo/gguf_extend
427f0500
Merge branch 'main' into hengguo/gguf_extend
7784abf1
wenhuach21
changed the title [WIP] Support for more gguf format and float zp for Q*_1 Support for more gguf format and float zp for Q*_1 308 days ago
add ut
127b33e3
move gguf_check to quantize_and_save, add torchvision to ut requirements
0eb3f1d4
fix
4438a551
fix
8ba65f71
fix bug for transformers new version
236b7dd4
try to fix triton runtimeerror
7676cf25
revert
9ef8e639
fix
dc863e47
fix pip install url
afe24c64
fix
59fcb45e
wenhuach21
deleted the hengguo/gguf_extend branch 306 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub