auto-round
Enable q_scale calibration for deepseek v2 on Gaudi
#1299
Merged

Enable q_scale calibration for deepseek v2 on Gaudi #1299

mengniwang95 merged 25 commits into main from mengni/fp8_sdpa
mengniwang95
mengniwang95 Enable q_scale for deepseek and fix kv_cache_scheme
0065ee58
mengniwang95 skip complex tensor on hpu
ada8a093
mengniwang95 bug fix
81b2a5c7
mengniwang95 fix bug
c3080998
mengniwang95 Merge branch 'main' into mengni/fp8_sdpa
50ffe0de
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
1b67ad09
mengniwang95 mengniwang95 requested a review from yiliu30 yiliu30 51 days ago
mengniwang95 mengniwang95 requested a review from n1ck-guo n1ck-guo 51 days ago
mengniwang95 fix CI
ac1837d3
yiliu30
yiliu30 commented on 2026-01-20
mengniwang95 fix conditional statements
096d5b93
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
5d926393
yiliu30 yiliu30 added hpu
yiliu30
yiliu30 commented on 2026-01-20
mengniwang95 fix replace on cuda
30f77c54
mengniwang95 use original kv
b9390db8
mengniwang95
mengniwang95 inherit replace module from the original module
2c6f76b5
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0f4c7990
mengniwang95 fix CI
486054d5
mengniwang95 refine code
9ce75f8b
mengniwang95 add ut
fa9f8aa6
mengniwang95 clean code
513a0196
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
5fa153df
mengniwang95 fix
903dbcc1
mengniwang95 Merge branch 'main' into mengni/fp8_sdpa
9b61c5e3
mengniwang95 add hpu ut req
5828ecb4
mengniwang95
yiliu30
mengniwang95
mengniwang95 Revert "use original kv"
e64dd1b9
mengniwang95 fix CI
136e26f6
yiliu30
yiliu30 approved these changes on 2026-01-21
mengniwang95 Update deepseek_v2.py
61755260
mengniwang95 Update deepseek_v2.py
3c46d984
mengniwang95 mengniwang95 merged d8ed35a8 into main 49 days ago
mengniwang95 mengniwang95 deleted the mengni/fp8_sdpa branch 49 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone