vllm
[Perf] add packed recurrent fast path for decode
#36596
Merged

[Perf] add packed recurrent fast path for decode #36596

caozuoba
caozuoba fla: add packed recurrent decode fast path
3036a745
caozuoba tests: fix packed recurrent decode reference call
2e7b9407
caozuoba caozuoba requested a review from sighingnow sighingnow 62 days ago
caozuoba caozuoba requested a review from mgoin mgoin 62 days ago
caozuoba caozuoba requested a review from tlrmchlsmth tlrmchlsmth 62 days ago
caozuoba caozuoba requested a review from WoosukKwon WoosukKwon 62 days ago
caozuoba caozuoba requested a review from yewentao256 yewentao256 62 days ago
mergify mergify added qwen
mergify
gemini-code-assist
gemini-code-assist commented on 2026-03-10
caozuoba style: ruff format
9bdba19d
caozuoba Merge branch 'main' into perf/gdn-packed
cafc0329
caozuoba
ZJY0516
ZJY0516 commented on 2026-03-11
caozuoba gdn: address review feedback
297a3f8b
caozuoba
caozuoba
ZJY0516
ZJY0516 commented on 2026-03-11
ZJY0516
caozuoba gdn: move decode path routing into forward core
6ba4d35d
ZJY0516
ZJY0516 commented on 2026-03-12
caozuoba refactor: inline baseline logic in forward core
1d4dafa7
ZJY0516
ZJY0516 approved these changes on 2026-03-12
ywang96 Merge branch 'main' into perf/gdn-packed
d95cfdf3
ywang96 ywang96 added ready
ywang96
ywang96 approved these changes on 2026-03-12
caozuoba
caozuoba Merge branch 'main' into perf/gdn-packed
e17186aa
caozuoba
caozuoba
mgoin
mgoin approved these changes on 2026-03-12
caozuoba
mgoin
vllm-bot vllm-bot merged 9e19f833 into main 59 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone