auto-round
Add G2-specific `FP8_STATIC` support
#1148
Merged

Add G2-specific `FP8_STATIC` support #1148

yiliu30 merged 35 commits into main from quant-attn-hpu-up
yiliu30
yiliu30 add attention quant
46749f0c
yiliu30 add ut
f743ffba
yiliu30 add llama patch
a81b5145
yiliu30 correct fp8
157f6d13
yiliu30 add utils
586462f8
yiliu30 merge main
591549b2
yiliu30 fix shape
65a467ee
yiliu30 enable compile for hpu
e9157967
yiliu30 compile rtn
be5d94ed
yiliu30 add cmd
9d09edec
yiliu30 udapte cmd
2d2c3122
yiliu30 fix atten
cdcc5c4f
yiliu30 fix
a7b6c33a
yiliu30 fix
0a2de215
yiliu30 fix q scale shape
ddc59772
yiliu30 clean max
1c989d90
yiliu30 merge main
76389619
yiliu30 clean
cc24671b
yiliu30 clean
ede9b27e
yiliu30 fix
45592e19
yiliu30 clean
9610c07d
yiliu30 fix
b311d72d
yiliu30 fix
8c5e0271
yiliu30 revert
e9ad45be
yiliu30 Merge branch 'main' into quant-attn-hpu-up
d0c3f949
yiliu30 clean code
5a3954f3
yiliu30 fix
3a75ae52
ClarkChin08 fix
4bfd73a1
yiliu30 clean
f81cad7b
yiliu30 remove test cmd
47bb10de
yiliu30 yiliu30 requested a review from n1ck-guo n1ck-guo 103 days ago
yiliu30 yiliu30 requested a review from wenhuach21 wenhuach21 103 days ago
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
3278bb46
yiliu30 Update base.py
d191bbac
yiliu30 Merge branch 'main' into quant-attn-hpu-up
b93ae29d
yiliu30 Merge branch 'main' into quant-attn-hpu-up
4ac50213
yiliu30 clean
f39b251b
n1ck-guo
n1ck-guo commented on 2025-12-19
n1ck-guo
n1ck-guo approved these changes on 2025-12-19
a32543254
a32543254 commented on 2025-12-19
yiliu30 yiliu30 merged 547d43f8 into main 100 days ago
yiliu30 yiliu30 deleted the quant-attn-hpu-up branch 100 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone