vllm
Fix/resupport nongated fused moe triton
#36412
Merged

Fix/resupport nongated fused moe triton #36412

shaunkotek
shaunkotek shaunkotek marked this pull request as ready for review 62 days ago
shaunkotek shaunkotek requested a review from mgoin mgoin 62 days ago
shaunkotek shaunkotek requested a review from pavanimajety pavanimajety 62 days ago
gemini-code-assist
gemini-code-assist commented on 2026-03-08
mgoin
mgoin approved these changes on 2026-03-08
mgoin mgoin added ready
mgoin mgoin added nvidia
shaunkotek resupport non gated fused moe in triton
c1126dba
nvnbagrov [Model] Nano Nemotron VL - fast media preprocessing (#35657)
bb679b1a
sagearc [Frontend] Add GPU-less render serving path (`vllm launch render`) (#…
570bf720
danisereb Add support for ModelOpt MXFP8 MoE models (#35986)
730745c2
ZJY0516 [cudagraph] fix cudagraph warning in deepseekv32 (#28044)
5a47a952
jikunshang [XPU][Doc] update xpu document about triton dependency/conflict issue…
d6ec74d7
hmellor Allow `markdownlint` to run locally (#36398)
ad818e78
yewentao256 [Dependency] Remove default ray dependency (#36170)
6a392273
weiguangli-io [Bugfix] Fix CPU OMP autobind assertion to use local_world_size (#35815)
6cefd2bb
noooop [Examples][1/n] Resettle basic examples. (#35579)
6ffbf5c6
shaunkotek fix: Use iterator as not to store all the file loads in memory at onc…
fa16442a
alex-jw-brooks Increase Flexibility for OOV Multimodal Token Handling (#34858)
1bca5351
DarkLight1337 [Misc] Move processors to `transformers_utils` (#35953)
c7ebbe70
cong-or feat(attention): extract KV-cache update from FlexAttention backend (…
4fdaf0ad
tusharshetty61 [Bugfix] Skip out-of-stage layers in get_layers_from_vllm_config for …
18c8fb96
noooop [Frontend][2/n] Improve pooling entrypoints | embed. (#36110)
cda0a73c
bigPYJ1151 [Bugfix] Avoid to replace non-tensor members in cpu model runner (#36…
b3b43160
alex-jw-brooks [Frontend] Add Support for MM Encoder/Decoder Beam Search (Online Tra…
97c8f800
zhenwei-intel [XPU] Add test script of PD disaggregation (#36434)
a6291dfc
xyang16 [Kernel] Add fused_sigmoid_gating_delta_rule_update kernel for Qwen3 …
2bfb5759
DarkLight1337 [Deprecation][1/2] Remove items deprecated in v0.18 (#36470)
aae9f536
khluu [ci] Bound openai dependency to 2.24.0 (#36471)
9bbc5e63
Isotr0py [MM Encoder] Default to use TORCH_SDPA backend for ViT on Volta/Turin…
e4b4a459
shaunkotek shaunkotek force pushed to e4b4a459 61 days ago
shaunkotek shaunkotek requested a review from noooop noooop 61 days ago
shaunkotek shaunkotek requested a review from tjtanaa tjtanaa 61 days ago
shaunkotek shaunkotek requested a review from patrickvonplaten patrickvonplaten 61 days ago
shaunkotek shaunkotek requested a review from sighingnow sighingnow 61 days ago
shaunkotek shaunkotek requested a review from bigPYJ1151 bigPYJ1151 61 days ago
shaunkotek shaunkotek requested a review from hmellor hmellor 61 days ago
shaunkotek shaunkotek requested a review from ApostaC ApostaC 61 days ago
shaunkotek shaunkotek requested a review from orozery orozery 61 days ago
shaunkotek shaunkotek requested a review from tlrmchlsmth tlrmchlsmth 61 days ago
shaunkotek shaunkotek requested a review from WoosukKwon WoosukKwon 61 days ago
shaunkotek shaunkotek requested a review from yewentao256 yewentao256 61 days ago
shaunkotek shaunkotek requested a review from DarkLight1337 DarkLight1337 61 days ago
shaunkotek shaunkotek requested a review from robertgshaw2-redhat robertgshaw2-redhat 61 days ago
shaunkotek shaunkotek requested a review from aarnphm aarnphm 61 days ago
shaunkotek shaunkotek requested a review from NickLucche NickLucche 61 days ago
shaunkotek shaunkotek requested a review from njhill njhill 61 days ago
shaunkotek shaunkotek requested a review from LucasWilkinson LucasWilkinson 61 days ago
shaunkotek shaunkotek requested a review from MatthewBonanni MatthewBonanni 61 days ago
shaunkotek shaunkotek requested a review from chaunceyjiang chaunceyjiang 61 days ago
shaunkotek shaunkotek requested a review from russellb russellb 61 days ago
shaunkotek shaunkotek requested a review from youkaichao youkaichao 61 days ago
shaunkotek shaunkotek requested a review from houseroad houseroad 61 days ago
shaunkotek shaunkotek requested a review from ProExpertProg ProExpertProg 61 days ago
shaunkotek shaunkotek requested a review from ywang96 ywang96 61 days ago
shaunkotek shaunkotek requested a review from 22quinn 22quinn 61 days ago
shaunkotek shaunkotek requested a review from jeejeelee jeejeelee 61 days ago
shaunkotek shaunkotek requested a review from zou3519 zou3519 61 days ago
shaunkotek shaunkotek requested a review from BoyuanFeng BoyuanFeng 61 days ago
mergify
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added frontend
mergify mergify added multi-modality
mergify mergify added performance
mergify mergify added qwen
mergify mergify added rocm
mergify mergify added cpu
mergify mergify added speculative-decoding
mergify mergify added v1
mergify mergify added kv-connector
shaunkotek Merge branch 'main' into fix/resupport-nongated-fused-moe-triton
3614f5ad
vllm-bot vllm-bot merged fa028207 into main 61 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone