auto-round
support quant lm_head for rtn w8afp8 static quant
#754
Merged

support quant lm_head for rtn w8afp8 static quant #754

n1ck-guo merged 13 commits into main from hengguo/w8afp8
n1ck-guo
n1ck-guo support quant lm_head for rtn w8afp8 static quant
b403307e
n1ck-guo n1ck-guo requested a review from yiliu30 yiliu30 306 days ago
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 306 days ago
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
2198bc8e
yiliu30
yiliu30 commented on 2025-08-22
n1ck-guo Merge branch 'main' into hengguo/w8afp8
250e4346
n1ck-guo add doc for infer bits
98cc7a0c
yiliu30 yiliu30 requested a review from yiliu30 yiliu30 306 days ago
n1ck-guo update
6ec49b26
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
67c38f80
yiliu30
yiliu30 commented on 2025-08-22
yiliu30
yiliu30 approved these changes on 2025-08-22
n1ck-guo fix ut
338db04f
n1ck-guo fix
d4bdbf31
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
14ca4880
n1ck-guo fix
dcdae50b
n1ck-guo merge
171516fc
n1ck-guo Merge branch 'main' into hengguo/w8afp8
66ff19ab
n1ck-guo fix
8d9d4b5f
n1ck-guo n1ck-guo merged de7ecc1b into main 303 days ago
n1ck-guo n1ck-guo deleted the hengguo/w8afp8 branch 303 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone