vllm
8f121f78 - [Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend (#32936)

Commit
24 days ago
[Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend (#32936) Signed-off-by: zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
Author
Parents
Loading