vllm
1f5d178e
- Revert "[Bugfix] default set cuda_graph_sizes to max_num_seqs for v1 engine" (#20128)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
225 days ago
Revert "[Bugfix] default set cuda_graph_sizes to max_num_seqs for v1 engine" (#20128)
References
#20128 - Revert "[Bugfix] default set cuda_graph_sizes to max_num_seqs for v1 engine"
Author
mgoin
Parents
27c065df
Loading