vllm
[Bugfix] Fix Fp8 Triton for non-gated MoE (Nemotron)
#31983
Open

[Bugfix] Fix Fp8 Triton for non-gated MoE (Nemotron) #31983

danisereb wants to merge 2 commits into vllm-project:main from danisereb:fix_nemo_bug
danisereb
gemini-code-assist
gemini-code-assist commented on 2026-01-08
danisereb danisereb marked this pull request as ready for review 5 days ago
danisereb danisereb requested a review from mgoin mgoin 5 days ago
danisereb danisereb requested a review from robertgshaw2-redhat robertgshaw2-redhat 5 days ago
danisereb danisereb requested a review from tlrmchlsmth tlrmchlsmth 5 days ago
danisereb danisereb requested a review from yewentao256 yewentao256 5 days ago
danisereb danisereb requested a review from pavanimajety pavanimajety 5 days ago
robertgshaw2-redhat robertgshaw2-redhat changed the title Fix ModelOptFp8MoEMethod for non-gated MoE (Nemotron) [Bugfix] Fix ModelOptFp8MoEMethod for non-gated MoE (Nemotron) 5 days ago
robertgshaw2-redhat robertgshaw2-redhat changed the title [Bugfix] Fix ModelOptFp8MoEMethod for non-gated MoE (Nemotron) [Bugfix] Fix Fp8 Triton for non-gated MoE (Nemotron) 5 days ago
robertgshaw2-redhat
LucasWilkinson
LucasWilkinson dismissed these changes on 2026-01-10
LucasWilkinson LucasWilkinson dismissed their stale review 4 days ago
missed Rob's comment
mergify
danisereb
danisereb danisereb force pushed from ed924368 to 4533a693 2 days ago
danisereb danisereb force pushed from 4533a693 to 85de95e2 2 days ago
mergify
cursor
cursor commented on 2026-01-11
danisereb danisereb force pushed from 85de95e2 to 360a708c 2 days ago
mergify mergify added bug
danisereb Fix ModelOptFp8MoEMethod for non-gated MoE
5e51a9d1
danisereb Add Nemotron Nano V3 BF16 to moe-refactor evals
7d082808
danisereb danisereb force pushed from 01106194 to 7d082808 3 hours ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone