vllm
c50e105a
- [Model Runner V2] Avoid prepare prefill kernel launch overhead (#34780)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 day ago
[Model Runner V2] Avoid prepare prefill kernel launch overhead (#34780) Signed-off-by: Nick Hill <nickhill123@gmail.com>
References
#34780 - [Model Runner V2] Avoid prepare prefill kernel launch overhead
Author
njhill
Parents
a766b303
Loading