vllm
fix(mxfp4): Disable monolithic path for TRITON backend with EP
#34270
Merged

fix(mxfp4): Disable monolithic path for TRITON backend with EP #34270

elizabetht
elizabetht elizabetht requested a review from mgoin mgoin 17 days ago
elizabetht elizabetht requested a review from tlrmchlsmth tlrmchlsmth 17 days ago
elizabetht elizabetht requested a review from WoosukKwon WoosukKwon 17 days ago
elizabetht elizabetht requested a review from yewentao256 yewentao256 17 days ago
elizabetht elizabetht requested a review from robertgshaw2-redhat robertgshaw2-redhat 17 days ago
elizabetht elizabetht requested a review from pavanimajety pavanimajety 17 days ago
gemini-code-assist
gemini-code-assist commented on 2026-02-10
mgoin
elizabetht fix(mxfp4): Apply expert_map remapping in triton_kernel_moe_forward f…
b8d49386
elizabetht elizabetht force pushed from 6df162e0 to b8d49386 17 days ago
mergify mergify added gpt-oss
elizabetht
elizabetht
tlrmchlsmth
tlrmchlsmth commented on 2026-02-13
varun-sundar-rabindranath
varun-sundar-rabindranath commented on 2026-02-13
varun-sundar-rabindranath
elizabetht Fix EP expert_map indexing dtype
68fc33e6
varun-sundar-rabindranath
varun-sundar-rabindranath commented on 2026-02-17
elizabetht Fix mxfp4 triton EP test to run on CUDA devices
64b77954
elizabetht Add CPU fallback for device selection in EP test
d5f5aead
varun-sundar-rabindranath
varun-sundar-rabindranath approved these changes on 2026-02-25
mgoin mgoin added ready
mgoin
mgoin approved these changes on 2026-02-25
mgoin Merge branch 'main' into fix/mxfp4-triton-ep-crash
cacd2283
vllm-bot vllm-bot merged c97234c0 into main 2 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone