vllm
[Feature]: support serving nvfp4 W4A16 moe models uisng Marlin
#30906
Open

[Feature]: support serving nvfp4 W4A16 moe models uisng Marlin #30906

EdalatiAli wants to merge 6 commits into vllm-project:main from EdalatiAli:main
EdalatiAli
EdalatiAli Add CompressedTensorsW4A16NVfp4MoeMethod to serve nvfp4 W4A16 quantiz…
ade34857
EdalatiAli fix naming
419c1b32
EdalatiAli add EPLB check
8f54548d
EdalatiAli EdalatiAli requested a review from mgoin mgoin 2 days ago
EdalatiAli EdalatiAli requested a review from robertgshaw2-redhat robertgshaw2-redhat 2 days ago
EdalatiAli EdalatiAli requested a review from tlrmchlsmth tlrmchlsmth 2 days ago
EdalatiAli EdalatiAli requested a review from yewentao256 yewentao256 2 days ago
EdalatiAli EdalatiAli requested a review from pavanimajety pavanimajety 2 days ago
EdalatiAli Merge branch 'main' into main
a148e770
mergify
gemini-code-assist
gemini-code-assist commented on 2025-12-17
chatgpt-codex-connector
chatgpt-codex-connector commented on 2025-12-17
EdalatiAli fix method signitures
88494b62
mergify
EdalatiAli align maybe_make_prepare_finalize inputs with the parent
051c97ef
dsikka

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone