vllm
[Bugfix] FlashInfer MXINT4 MoE crashes, missing do_finalize
#39315
Merged

[Bugfix] FlashInfer MXINT4 MoE crashes, missing do_finalize #39315

benchislett
benchislett fix flashinfer mxint4 crash
3067ce5f
benchislett benchislett requested a review from mgoin mgoin 5 days ago
benchislett benchislett requested a review from robertgshaw2-redhat robertgshaw2-redhat 5 days ago
benchislett benchislett requested a review from tlrmchlsmth tlrmchlsmth 5 days ago
benchislett benchislett requested a review from yewentao256 yewentao256 5 days ago
benchislett benchislett requested a review from pavanimajety pavanimajety 5 days ago
mergify mergify added nvidia
mergify mergify added bug
gemini-code-assist
gemini-code-assist commented on 2026-04-08
mgoin
benchislett add unit test for mxint4 wrapper
ea0a7c4c
benchislett benchislett requested a review from WoosukKwon WoosukKwon 5 days ago
benchislett change test to cover wrapper only
e2c85417
benchislett Update vllm/model_executor/layers/quantization/utils/flashinfer_mxint…
2f66debf
mgoin
mgoin approved these changes on 2026-04-08
mgoin mgoin added ready
benchislett benchislett merged 8332078c into main 4 days ago
benchislett benchislett deleted the bugfix-mxint4-moe branch 4 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone