vllm
[Perf] Add TRTLLM FP8 MoE Modular Kernel
#36307
Merged

[Perf] Add TRTLLM FP8 MoE Modular Kernel #36307

wzhao18
wzhao18 wzhao18 requested a review from mgoin mgoin 22 days ago
wzhao18 wzhao18 requested a review from pavanimajety pavanimajety 22 days ago
mergify mergify added nvidia
wzhao18 wzhao18 force pushed from 6014cf25 to 4efbb109 22 days ago
gemini-code-assist
gemini-code-assist commented on 2026-03-07
wzhao18
wzhao18 wzhao18 changed the title Add TRTLLM FP8 MoE Modular Kernel [Perf] Add TRTLLM FP8 MoE Modular Kernel 22 days ago
mergify
mergify mergify added needs-rebase
wzhao18 Support trtllm fp8 modular kernel
9b3e8a95
wzhao18 Add base class for trtllm fp8 modular moe
7f0c69d7
wzhao18 Add base class for trtllm fp8 modular moe
20911769
wzhao18 Fix trtllm moe modular monolithic
6c80d32a
wzhao18 fix linting
469b8e5c
wzhao18 Revert changing minimax m2 routing logits dtype
e4545228
wzhao18 wzhao18 force pushed from 4efbb109 to e4545228 18 days ago
mergify mergify removed needs-rebase
mergify
wzhao18 fixup
2c55cd73
wzhao18 wzhao18 requested a review from LucasWilkinson LucasWilkinson 18 days ago
wzhao18 wzhao18 requested a review from MatthewBonanni MatthewBonanni 18 days ago
mergify
wzhao18 fixup
507d3d44
mgoin mgoin added ready
mgoin
wzhao18
wzhao18 Update tests
a3ab5c94
wzhao18 Fix gemini comments
19af1ba1
wzhao18 wzhao18 requested a review from tlrmchlsmth tlrmchlsmth 18 days ago
wzhao18 wzhao18 requested a review from WoosukKwon WoosukKwon 18 days ago
wzhao18 wzhao18 requested a review from yewentao256 yewentao256 18 days ago
mgoin
mgoin approved these changes on 2026-03-12
mgoin Merge branch 'main' into wzhao/fp8-trtllm-modular-moe
451172ac
vllm-bot vllm-bot merged 2e693f48 into main 16 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone