vllm
950cf9e5
- [Bugfix] Use PIECEWISE cudagraphs on Blackwell if max_model_len > 131072 (#27114)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
128 days ago
[Bugfix] Use PIECEWISE cudagraphs on Blackwell if max_model_len > 131072 (#27114) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#27114 - [Bugfix] Use PIECEWISE cudagraphs on Blackwell if max_model_len > 131072
Author
mgoin
Parents
3125d799
Loading