vllm
8b141ed8
- full cudagraph for flex-attn (#36298)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
17 days ago
full cudagraph for flex-attn (#36298) Signed-off-by: shunting314 <shunting@meta.com>
References
#36298 - full cudagraph for flex-attn
Author
shunting314
Parents
2ad7c033
Loading