[Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes #39773
TheEpicDolphin
marked this pull request as ready for review 26 days ago
TheEpicDolphin
changed the title [Model Runner V2] Disable piecewise cudagraph mode for eagle speculator [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator 26 days ago
TheEpicDolphin
changed the title [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator 26 days ago
pad last_token_indices to prevent stale values from causing out of er…
56bb1e96
TheEpicDolphin
changed the title [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle speculator [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes 25 days ago
disable piecewise cg for draft decodes
6febfbad
TheEpicDolphin
changed the title [WIP][Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes 25 days ago
TheEpicDolphin
marked this pull request as ready for review 25 days ago
remove 'decode_mode', not needed
30a1216b
TheEpicDolphin
deleted the gdelfin/mrv2-eagle-disable-piecewise-cg branch 25 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub