GGUF support step2: add naive Q2_KS and Q4_KS #448
init
31017be1
update
7b435cd4
Merge branch 'main' into hengguo/gguf_q4_k
26420e0d
wenhuach21
marked this pull request as draft 1 year ago
update
e938fec0
[pre-commit.ci] auto fixes from pre-commit.com hooks
a30a24f3
merge main
670612d7
update
30d77078
Merge branch 'hengguo/gguf_q4_k' of https://github.com/intel/auto-rou…
5d5d30fa
update
f2790e86
wenhuach21
marked this pull request as ready for review 1 year ago
q2_k
5418bb1a
update
0656c578
wenhuach21
changed the title suport for gguf Q*_K Step2 support naive Q2_KS and Q4_KS 1 year ago
fix
9897fd9e
spell
d421a57e
fix config
a954c0c6
act quant determine in wrapper
1b12b0a3
rollback
37f12ebc
wenhuach21
changed the title Step2 support naive Q2_KS and Q4_KS GGUF support step2: add naive Q2_KS and Q4_KS 1 year ago
fix ut fail
8d705d84
reset
ef38aa0a
Merge branch 'main' into hengguo/gguf_q4_k
0ad36840
update
894fb86e
wenhuach21
deleted the hengguo/gguf_q4_k branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub