vllm
[Feat] Support non-gated activations in NVFP4 modelopt path
#29004
Merged

[Feat] Support non-gated activations in NVFP4 modelopt path #29004

omera-nv
omera-nv omera-nv requested a review from mgoin mgoin 87 days ago
omera-nv omera-nv requested a review from tlrmchlsmth tlrmchlsmth 87 days ago
omera-nv omera-nv requested a review from WoosukKwon WoosukKwon 87 days ago
omera-nv omera-nv requested a review from yewentao256 yewentao256 87 days ago
omera-nv omera-nv requested a review from robertgshaw2-redhat robertgshaw2-redhat 87 days ago
omera-nv omera-nv requested a review from pavanimajety pavanimajety 87 days ago
mergify mergify added nvidia
gemini-code-assist
gemini-code-assist commented on 2025-11-19
chatgpt-codex-connector
chatgpt-codex-connector commented on 2025-11-19
tlrmchlsmth
tlrmchlsmth commented on 2025-11-19
omera-nv omera-nv force pushed 87 days ago
omera-nv omera-nv requested a review from tlrmchlsmth tlrmchlsmth 87 days ago
mergify
mergify mergify added needs-rebase
omera-nv omera-nv force pushed 82 days ago
mergify mergify removed needs-rebase
omera-nv omera-nv force pushed 82 days ago
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-11-25
tlrmchlsmth
mergify
mergify mergify added needs-rebase
omera-nv omera-nv force pushed to 1aa30085 80 days ago
mergify mergify removed needs-rebase
mgoin mgoin added quantization
mgoin mgoin added ready
omera-nv modelopt nvfp4 silu2 ungated weight loading
145f459b
omera-nv fix flashinfer moe test
e95d5fe9
omera-nv test multiple activations
4aa6fa99
omera-nv fix padding in swizzling
64b40c5e
omera-nv assert cutlass backend
1c9900a8
omera-nv Clarify support message
6fd88aef
omera-nv Fix assertion message
c06056f0
omera-nv param[expert_id] -> expert_data
31ecb633
Naveassaf Naveassaf force pushed from 1aa30085 to 31ecb633 76 days ago
mgoin
mgoin approved these changes on 2025-11-30
mgoin mgoin merged 39d28108 into main 76 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone