fix(mxfp4): Disable monolithic path for TRITON backend with EP #34270
fix(mxfp4): Apply expert_map remapping in triton_kernel_moe_forward f…
b8d49386
elizabetht
force pushed
from
6df162e0
to
b8d49386
17 days ago
Fix EP expert_map indexing dtype
68fc33e6
Fix mxfp4 triton EP test to run on CUDA devices
64b77954
Add CPU fallback for device selection in EP test
d5f5aead
mgoin
approved these changes
on 2026-02-25
Merge branch 'main' into fix/mxfp4-triton-ep-crash
cacd2283
vllm-bot
merged
c97234c0
into main 2 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub