vllm
3bfe55a0
- [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes (#39773)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
10 days ago
[Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes (#39773) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
References
#39773 - [Model Runner V2] Disable piecewise cudagraph mode fallback for eagle draft decodes
Author
TheEpicDolphin
Parents
b569620f
Loading