auto-round
support model_free WOQ quantization
#1699
Merged

support model_free WOQ quantization #1699

xin3he merged 51 commits into main from xinhe/4-14
xin3he
xin3he implement model free
dc592e99
xin3he polished implementation
177bf48b
xin3he remove useless gpu_concurrency
97e03620
xin3he 添加预编译模式匹配器以提高量化过程中的性能和可扩展性
ff47a97a
xin3he fix typo
4d9ad0e5
xin3he update document
58709e64
xin3he remove useless code and update UT
d3951f26
xin3he mend
16991ea2
xin3he remove high_gpu_mem_usage since no performacen benefit.
83b9b4fe
xin3he update regex
687260db
xin3he fix bug and simplify UT
68d0cb7b
xin3he fix bug
312f75df
xin3he add WOQ limiation and support bits group_size setting
3ca4d3b5
xin3he xin3he requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 64 days ago
xin3he Merge branch 'main' into xinhe/4-14
3f15e02d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
47b3f35d
xin3he update doc
76f99151
xin3he minor fix
c588ad22
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-04-17
xin3he enable quant_nontext_module
0c141653
xin3he xin3he requested a review from changwangss changwangss 61 days ago
xin3he xin3he requested a review from yiliu30 yiliu30 61 days ago
xin3he xin3he requested a review from wenhuach21 wenhuach21 61 days ago
xin3he xin3he requested a review from WeiweiZhang1 WeiweiZhang1 61 days ago
xin3he xin3he requested a review from n1ck-guo n1ck-guo 61 days ago
n1ck-guo
n1ck-guo commented on 2026-04-21
yiliu30
yiliu30 commented on 2026-04-21
n1ck-guo
wenhuach21
xin3he
xin3he
wenhuach21
xin3he Enhance model-free quantization support and improve documentation
405de53d
xin3he Merge remote-tracking branch 'origin/main' into xinhe/4-14
6c5ce29a
xin3he support loading pytorch_model.bin and ignore conv1d embed by creating…
0697324a
xin3he add UT to cover conv1d detection
f4fc5f41
xin3he support MXFP4/8 dequantization
4f6f97e4
xin3he Merge branch 'main' into xinhe/4-14
ed46cd68
xin3he fix pylint
7e3a3f87
xin3he Merge branch 'main' into xinhe/4-14
958191a5
xin3he add auto fallback and change class name
7440c321
xin3he
xin3he
azure-pipelines
xin3he xin3he requested a review from n1ck-guo n1ck-guo 56 days ago
xin3he xin3he requested a review from yiliu30 yiliu30 56 days ago
xin3he fix CI
8b8d084e
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
wenhuach21
wenhuach21 commented on 2026-04-26
xin3he update readme
eb5fdf43
xin3he 添加回退压缩器功能以支持量化和保存
98a50401
xin3he Merge branch 'main' into xinhe/4-14
46465c39
xin3he
azure-pipelines
xin3he support diffusion model
7c76188a
xin3he fix bug
a92acc2b
xin3he support layer_config={".ffn.experts.": {"scheme": "W2A16"}} usage
46ed32c4
xin3he fix bug
6f41cec5
xin3he
azure-pipelines
xin3he update UT
9f81c67c
xin3he fix bug
16ead43b
xin3he Merge remote-tracking branch 'origin/main' into xinhe/4-14
48994a40
xin3he add model free for new arch
3d9812ca
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
bd318616
xin3he Merge branch 'main' into xinhe/4-14
efd8753a
xin3he
azure-pipelines
xin3he
azure-pipelines
wenhuach21
wenhuach21 approved these changes on 2026-04-30
xin3he fix issue in comments
312eabef
xin3he unify cli content
dbdaf9f1
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
6ba73d61
xin3he update per comments
bba56fe8
xin3he Merge branch 'main' into xinhe/4-14
fca22106
xin3he fix bug
22711c39
xin3he Merge branch 'main' into xinhe/4-14
7e537ef7
xin3he
azure-pipelines
xin3he fix CI
7aac7952
xin3he remove breakpoint
1888f86c
xin3he
azure-pipelines
xin3he xin3he force pushed from f593cfa2 to 551e1ca5 42 days ago
xin3he xin3he force pushed from 551e1ca5 to f9cd4c7d 42 days ago
xin3he add iters in init kwargs for new arch
f9cd4c7d
xin3he
azure-pipelines
xin3he
azure-pipelines
xin3he xin3he merged f0013f09 into main 40 days ago
xin3he xin3he deleted the xinhe/4-14 branch 40 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone