vllm
950cf9e5 - [Bugfix] Use PIECEWISE cudagraphs on Blackwell if max_model_len > 131072 (#27114)

Commit
128 days ago
[Bugfix] Use PIECEWISE cudagraphs on Blackwell if max_model_len > 131072 (#27114) Signed-off-by: mgoin <mgoin64@gmail.com>
Author
Parents
Loading