vllm
ffb3d553
- [Model Runner V2] Init cuda graph pool when necessary (#33217)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 day ago
[Model Runner V2] Init cuda graph pool when necessary (#33217) Signed-off-by: Xinyu Chen <xinyu1.chen@intel.com>
References
#33217 - [Model Runner V2] Init cuda graph pool when necessary
Author
xinyu-intel
Parents
fa7e0bfa
Loading