vllm
[Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes
#39773
Merged

[Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes #39773

TheEpicDolphin
mergify mergify added nvidia
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2026-04-14
TheEpicDolphin TheEpicDolphin force pushed 26 days ago
TheEpicDolphin TheEpicDolphin marked this pull request as ready for review 26 days ago
TheEpicDolphin TheEpicDolphin requested a review from WoosukKwon WoosukKwon 26 days ago
TheEpicDolphin TheEpicDolphin requested a review from njhill njhill 26 days ago
TheEpicDolphin TheEpicDolphin changed the title [Model Runner V2] Disable piecewise cudagraph mode for eagle speculator [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator 26 days ago
WoosukKwon WoosukKwon added ready
TheEpicDolphin TheEpicDolphin marked this pull request as draft 26 days ago
TheEpicDolphin TheEpicDolphin changed the title [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator 26 days ago
TheEpicDolphin TheEpicDolphin force pushed 26 days ago
TheEpicDolphin TheEpicDolphin force pushed 26 days ago
TheEpicDolphin pad last_token_indices to prevent stale values from causing out of er…
56bb1e96
TheEpicDolphin TheEpicDolphin force pushed 25 days ago
TheEpicDolphin TheEpicDolphin changed the title [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes 25 days ago
TheEpicDolphin disable piecewise cg for draft decodes
6febfbad
TheEpicDolphin TheEpicDolphin changed the title [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes 25 days ago
TheEpicDolphin TheEpicDolphin marked this pull request as ready for review 25 days ago
WoosukKwon
WoosukKwon approved these changes on 2026-04-14
TheEpicDolphin remove 'decode_mode', not needed
30a1216b
TheEpicDolphin TheEpicDolphin force pushed to 30a1216b 25 days ago
WoosukKwon WoosukKwon merged 3bfe55a0 into main 25 days ago
TheEpicDolphin TheEpicDolphin deleted the gdelfin/mrv2-eagle-disable-piecewise-cg branch 25 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone