vllm
[NemotronH] Do not force router to run in fp32
#34582
Merged

[NemotronH] Do not force router to run in fp32 #34582

roikoren755
roikoren755 roikoren755 requested a review from mgoin mgoin 83 days ago
roikoren755 roikoren755 requested a review from pavanimajety pavanimajety 83 days ago
roikoren755 Do not force NemotronH router to fp32
608fbdf3
roikoren755 Fix router logits dtype mismatch
48ab68e2
roikoren755 roikoren755 force pushed to 48ab68e2 83 days ago
mergify mergify added nvidia
gemini-code-assist
gemini-code-assist commented on 2026-02-15
tomeras91
tomeras91 commented on 2026-02-15
roikoren755 Delete debugging print
8301f2cd
roikoren755 Remove unnecessary .to call
7e58b855
mgoin
mgoin approved these changes on 2026-02-16
mgoin mgoin added performance
mgoin mgoin added ready
roikoren755
vllm-bot vllm-bot merged 3b30e615 into main 82 days ago
roikoren755 roikoren755 deleted the feat/nemotronh-bf16-router branch 81 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone