auto-round
Support mxfp nvfp lmhead quant
#1051
Merged

Support mxfp nvfp lmhead quant #1051

WeiweiZhang1 merged 24 commits into main from support_mxfp_nvfp_lmhead_quant
WeiweiZhang1
WeiweiZhang1 fp8 exporting bugfix
719e5abb
WeiweiZhang1 Merge branch 'main' of https://github.com/intel/auto-round into main
8e8b04f6
WeiweiZhang1 Merge branch 'main' of https://github.com/intel/auto-round into main
57842a16
WeiweiZhang1 refine exllama backend cuda UT
c2daa790
WeiweiZhang1 Merge branch 'main' of https://github.com/intel/auto-round into main
ca36a70e
WeiweiZhang1 Merge branch 'main' of https://github.com/intel/auto-round into main
9ab08431
WeiweiZhang1 add lm_head layer act_max hook, enable mxfp/nvfp lm_head export
8176d2ed
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
fbba8a6a
WeiweiZhang1 fixtypo
4d097f8b
WeiweiZhang1 fixtypo
024dfc0b
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
925f038f
WeiweiZhang1 fix ut typo
d7681f91
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
7c2c255b
WeiweiZhang1 WeiweiZhang1 requested a review from wenhuach21 wenhuach21 36 days ago
wenhuach21
wenhuach21 commented on 2025-11-21
wenhuach21
wenhuach21 commented on 2025-11-24
wenhuach21
wenhuach21 commented on 2025-11-24
xin3he
xin3he approved these changes on 2025-11-25
WeiweiZhang1 Merge branch 'support_mxfp_nvfp_lmhead_quant' of https://github.com/i…
5c921613
WeiweiZhang1 refine logs, fix pack_layer for awq&gptq
a35f8040
WeiweiZhang1 Merge branch 'main' of https://github.com/intel/auto-round into suppo…
3b3b666e
WeiweiZhang1 refine log, fix pack_layer for awq&gptq
cc780961
WeiweiZhang1 Merge branch 'support_mxfp_nvfp_lmhead_quant' of https://github.com/i…
d461adf8
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
a6b914bd
WeiweiZhang1 fixtypo
4d807c04
wenhuach21
wenhuach21 commented on 2025-11-27
WeiweiZhang1 add awq&gptq lm_head UT
17c71f76
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
f96274d2
WeiweiZhang1 fix local path
31a30c7d
WeiweiZhang1 Merge branch 'support_mxfp_nvfp_lmhead_quant' of https://github.com/i…
6f8f4a50
WeiweiZhang1 WeiweiZhang1 merged c4a14799 into main 29 days ago
WeiweiZhang1 WeiweiZhang1 deleted the support_mxfp_nvfp_lmhead_quant branch 29 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone