auto-round
Add static FP8 attention support
#1061
Merged

Add static FP8 attention support #1061

yiliu30 merged 35 commits into main from quant-attn
yiliu30
yiliu30 add attention quant
46749f0c
yiliu30 add ut
f743ffba
yiliu30 add llama patch
a81b5145
yiliu30 correct fp8
157f6d13
yiliu30 add utils
586462f8
yiliu30 merge main
591549b2
yiliu30 fix shape
65a467ee
yiliu30 tmp
da1fe7fc
yiliu30 clean code
4f3b0a32
yiliu30 Merge branch 'main' into quant-attn
ae3a4aa7
yiliu30 add ut
ceca38a6
yiliu30 clean
a49c09b7
yiliu30 Merge branch 'quant-attn' of https://github.com/intel/auto-round into…
90bf465f
yiliu30 fix
adc5cb3b
yiliu30 refine
a61bd657
yiliu30 clean
c4bfce03
yiliu30 fix
478eef09
yiliu30 fix
5ed5f724
yiliu30 fix
53f6ae8a
yiliu30 fix
741f818f
yiliu30 fix alias tensor
ae6cec51
yiliu30 fix ut
ffa5ac56
yiliu30 Merge branch 'main' into quant-attn
c7d72d57
yiliu30 Merge branch 'main' into quant-attn
641089df
yiliu30 Merge branch 'main' into quant-attn
61ca489d
yiliu30 update
b698ec43
yiliu30 fix
3b363531
yiliu30 fix
4e541798
yiliu30 Merge branch 'main' into quant-attn
2aaea060
yiliu30 yiliu30 enabled auto-merge (squash) 106 days ago
yiliu30 yiliu30 requested a review from n1ck-guo n1ck-guo 106 days ago
yiliu30 yiliu30 requested a review from wenhuach21 wenhuach21 106 days ago
wenhuach21 wenhuach21 requested a review from xin3he xin3he 106 days ago
wenhuach21
wenhuach21 commented on 2025-11-25
wenhuach21
wenhuach21 commented on 2025-11-25
wenhuach21
wenhuach21 commented on 2025-11-25
wenhuach21
wenhuach21 commented on 2025-11-25
wenhuach21
wenhuach21 commented on 2025-11-25
wenhuach21
wenhuach21 commented on 2025-11-25
yiliu30 upadte
cd7a7070
yiliu30 Merge branch 'main' into quant-attn
2743582a
disabled auto-merge 105 days ago
Manually disabled by user
yiliu30 Merge branch 'quant-attn' of https://github.com/intel/auto-round into…
6d1e0abf
yiliu30 update
cc8d06a2
yiliu30 update
a47a0491
wenhuach21 wenhuach21 requested a review from wenhuach21 wenhuach21 105 days ago
wenhuach21
wenhuach21 approved these changes on 2025-11-25
yiliu30 Merge branch 'main' into quant-attn
54d6f5ab
yiliu30 yiliu30 merged 6bde0e11 into main 105 days ago
yiliu30 yiliu30 deleted the quant-attn branch 105 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone