vllm
[Bugfix] Fix NVFP4 TRTLLM MoE non-gated support; add gsm8k for Nemotron-3-Nano FP8+NVFP4
#34725
Merged

[Bugfix] Fix NVFP4 TRTLLM MoE non-gated support; add gsm8k for Nemotron-3-Nano FP8+NVFP4 #34725

mgoin
mgoin Add GSM8k test for Nemotron-Nano-30B-NvFp4-fi-trtllm
a6557bdc
mergify mergify added nvidia
gemini-code-assist
gemini-code-assist commented on 2026-02-17
mgoin Update to general test
f4889617
mgoin Fix nvfp4 and add fp8 trtllm
b357be6d
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 82 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 82 days ago
mgoin mgoin requested a review from yewentao256 yewentao256 82 days ago
mgoin mgoin requested a review from pavanimajety pavanimajety 82 days ago
mgoin mgoin changed the title Add GSM8k test for Nemotron-Nano-30B-NvFp4-fi-trtllm Fix NVFP4 TRTLLM MoE non-gated support; add gsm8k for Nemotron-3-Nano FP8+NVFP4 82 days ago
robertgshaw2-redhat
robertgshaw2-redhat commented on 2026-02-17
robertgshaw2-redhat
robertgshaw2-redhat commented on 2026-02-17
mgoin mgoin added bug
mgoin mgoin added ready
mgoin Fix env
37d5fbcb
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2026-02-17
robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) 81 days ago
mgoin mgoin changed the title Fix NVFP4 TRTLLM MoE non-gated support; add gsm8k for Nemotron-3-Nano FP8+NVFP4 [Bugfix] Fix NVFP4 TRTLLM MoE non-gated support; add gsm8k for Nemotron-3-Nano FP8+NVFP4 81 days ago
vllm-bot vllm-bot merged caeb887b into main 81 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone