vllm
5f1af97f
- [V1] [Hybrid] Enable Full CUDA graph by default for hybrid models in V1 (#22594)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
104 days ago
[V1] [Hybrid] Enable Full CUDA graph by default for hybrid models in V1 (#22594) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
References
#22594 - [V1] [Hybrid] Enable Full CUDA graph by default for hybrid models in V1
Author
tdoublep
Parents
c3b0fd1e
Loading