🚨 EP: fix EP router contract for many models + honor FP8 scale format #46818
honor the quant config's scale format and refuse
777c2726
IlyasMoutawwakil
changed the title FP8: Honor the quant config's scale format FP8: Honor the quant config's scale format and fix EP 6 days ago
fix fp4 specific
122adb59
strict
98034c9d
deeper ep fix
f1e2235c
test
83625bce
style
b34872ee
IlyasMoutawwakil
changed the title FP8: Honor the quant config's scale format and fix EP EP+FP8: fix EP router contract for many models and honor FP8 scale format 6 days ago
add assertion
0e0550a2
more ep plans
4a585fe6
fold tp+ep checks and ep assert into one helper
0288a11a
style
7d5976d4
rasie propper error upon ep request with no ep plan
8ea20bf5
Merge branch 'main' into fix-glm-dsa
d288a9e6
vasqu
commented
on 2026-06-23
IlyasMoutawwakil
changed the title EP+FP8: fix EP router contract for many models and honor FP8 scale format 🚨 EP: fix EP router contract for many models + honor FP8 scale format 5 days ago
address anton's comments and make more modular
d7a2fea9
fix repo
99f28f14
vasqu
commented
on 2026-06-23
more modular
b8f8eedd
more modular dsv2 topK router
4b04debf
modular phimoe router
c015bd2d
fix
6cf58567
reverting phimoe changes
c8379360
vasqu
approved these changes
on 2026-06-24
last modular attempt
9cd6feb3
correct fix ?
31f21395
add BC variation just in case
5c2b2aff
clearer message
918d6c47
post init workaround?
c54aee90
remove the warning
6de8bffd
Merge branch 'main' into fix-glm-dsa
d1dcde04
fix CI
54ab3ee2
ci
8d9846bf
fixup modular so we don't need to mess up more attribute maps
dd65d891
Merge branch 'main' into fix-glm-dsa
15af0007
fix glm4v moe
a5be1ba8
vasqu
enabled auto-merge 3 days ago
fix exaone
d71be264
vasqu
merged
8edd87b6
into main 3 days ago
vasqu
deleted the fix-glm-dsa branch 3 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub