fix gguf bug of moe models and lmhead/embedding bits setting regression #628
fix
17357868
merge main
0107f4ac
debug
60131695
wenhuach21
marked this pull request as draft 181 days ago
fix
e92586e0
merge main
ecd63283
reverse
eef16219
Merge branch 'main' into hengguo/fix_issue_604
f929e30a
n1ck-guo
marked this pull request as ready for review 180 days ago
clean
772774a0
Merge branch 'hengguo/fix_issue_604' of https://github.com/intel/auto…
c93fded1
fix
7ece2c99
wenhuach21
changed the title fix bug of Qwen3 moe fix gguf bug of Qwen3 moe 180 days ago
fix by comment
e2964b03
fix by comment
615d55e4
fix
faef1c08
wenhuach21
changed the title fix gguf bug of Qwen3 moe fix gguf bug of moe models 180 days ago
change api
1d72fbd1
add ut
098573a5
fix embedding and lm-head bits regression
f08cf973
Merge branch 'hengguo/fix_issue_604' of https://github.com/intel/auto…
f27111b1
wenhuach21
changed the title fix gguf bug of moe models fix gguf bug of moe models and lmhead/embedding bits setting regression 180 days ago
fix ut
3679ff0d
update
d880ce8c
fix
1bf7eeee
fix lm_head / embedding
4b2df695
code scan
b248f058
support blockwise imatrix
694afd15
[pre-commit.ci] auto fixes from pre-commit.com hooks
baf97b78
fix
9ec413a2
n1ck-guo
merged
20eed5b6
into main 179 days ago
n1ck-guo
deleted the hengguo/fix_issue_604 branch 179 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub