vllm
8f121f78
- [Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend (#32936)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
24 days ago
[Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend (#32936) Signed-off-by: zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
References
#32936 - [Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend
Author
izhuhaoran
Parents
cb5f7501
Loading