transformers
🚨 EP: fix EP router contract for many models + honor FP8 scale format
#46818
Merged

🚨 EP: fix EP router contract for many models + honor FP8 scale format #46818

vasqu merged 32 commits into main from fix-glm-dsa
IlyasMoutawwakil
IlyasMoutawwakil honor the quant config's scale format and refuse
777c2726
IlyasMoutawwakil IlyasMoutawwakil changed the title FP8: Honor the quant config's scale format FP8: Honor the quant config's scale format and fix EP 6 days ago
HuggingFaceDocBuilderDev
IlyasMoutawwakil fix fp4 specific
122adb59
IlyasMoutawwakil strict
98034c9d
IlyasMoutawwakil deeper ep fix
f1e2235c
IlyasMoutawwakil test
83625bce
IlyasMoutawwakil style
b34872ee
IlyasMoutawwakil IlyasMoutawwakil marked this pull request as ready for review 6 days ago
github-actions github-actions requested a review from ArthurZucker ArthurZucker 6 days ago
github-actions github-actions requested a review from Rocketknight1 Rocketknight1 6 days ago
IlyasMoutawwakil IlyasMoutawwakil changed the title FP8: Honor the quant config's scale format and fix EP EP+FP8: fix EP router contract for many models and honor FP8 scale format 6 days ago
IlyasMoutawwakil add assertion
0e0550a2
3outeille
3outeille commented on 2026-06-23
3outeille
3outeille commented on 2026-06-23
3outeille
3outeille commented on 2026-06-23
IlyasMoutawwakil more ep plans
4a585fe6
IlyasMoutawwakil fold tp+ep checks and ep assert into one helper
0288a11a
IlyasMoutawwakil style
7d5976d4
IlyasMoutawwakil IlyasMoutawwakil requested a review from 3outeille 3outeille 6 days ago
IlyasMoutawwakil rasie propper error upon ep request with no ep plan
8ea20bf5
IlyasMoutawwakil Merge branch 'main' into fix-glm-dsa
d288a9e6
vasqu
vasqu commented on 2026-06-23
IlyasMoutawwakil IlyasMoutawwakil changed the title EP+FP8: fix EP router contract for many models and honor FP8 scale format 🚨 EP: fix EP router contract for many models + honor FP8 scale format 5 days ago
IlyasMoutawwakil address anton's comments and make more modular
d7a2fea9
IlyasMoutawwakil fix repo
99f28f14
vasqu
vasqu commented on 2026-06-23
IlyasMoutawwakil more modular
b8f8eedd
IlyasMoutawwakil more modular dsv2 topK router
4b04debf
IlyasMoutawwakil
IlyasMoutawwakil commented on 2026-06-24
IlyasMoutawwakil
IlyasMoutawwakil commented on 2026-06-24
IlyasMoutawwakil modular phimoe router
c015bd2d
IlyasMoutawwakil fix
6cf58567
IlyasMoutawwakil
IlyasMoutawwakil commented on 2026-06-24
IlyasMoutawwakil reverting phimoe changes
c8379360
vasqu
vasqu approved these changes on 2026-06-24
vasqu
IlyasMoutawwakil last modular attempt
9cd6feb3
IlyasMoutawwakil correct fix ?
31f21395
vasqu add BC variation just in case
5c2b2aff
vasqu clearer message
918d6c47
vasqu post init workaround?
c54aee90
ArthurZucker
ArthurZucker approved these changes on 2026-06-24
vasqu remove the warning
6de8bffd
vasqu Merge branch 'main' into fix-glm-dsa
d1dcde04
vasqu fix CI
54ab3ee2
vasqu ci
8d9846bf
github-actions
vasqu
vasqu fixup modular so we don't need to mess up more attribute maps
dd65d891
vasqu Merge branch 'main' into fix-glm-dsa
15af0007
github-actions
vasqu fix glm4v moe
a5be1ba8
vasqu vasqu enabled auto-merge 3 days ago
vasqu fix exaone
d71be264
vasqu vasqu merged 8edd87b6 into main 3 days ago
vasqu vasqu deleted the fix-glm-dsa branch 3 days ago
vasqu vasqu added for patch

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone