vllm
[Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] for TRTLLM per-tensor FP8 MoE
#33620
Merged

[Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] for TRTLLM per-tensor FP8 MoE #33620

mgoin
mgoin Disable RoutingMethodType.[Renormalize,RenormalizeNaive] TRTLLM
cc10a080
mgoin mgoin requested a review from pavanimajety pavanimajety 98 days ago
mergify mergify added nvidia
mergify mergify added bug
gemini-code-assist
gemini-code-assist commented on 2026-02-03
mgoin mgoin added this to the v0.15.1 Hotfix milestone 98 days ago
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2026-02-03
robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) 98 days ago
github-actions github-actions added ready
mgoin mgoin changed the title [Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] TRTLLM per-tensor FP8 MoE [Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] for TRTLLM per-tensor FP8 MoE 97 days ago
robertgshaw2-redhat robertgshaw2-redhat merged e346e2d0 into main 97 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone