vllm
3bfe55a0 - [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes (#39773)

Commit
10 days ago
[Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes (#39773) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
Parents
Loading