vllm
ec38a736
- [Model Runner V2] Use packed mask for prompt bin counts (#29756)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
32 days ago
[Model Runner V2] Use packed mask for prompt bin counts (#29756) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
References
#29756 - [Model Runner V2] Use packed mask for prompt bin counts
Author
WoosukKwon
Parents
21c26279
Loading