vllm
090f485a
- add support for cutlass mla full cudagraphs
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
265 days ago
add support for cutlass mla full cudagraphs Signed-off-by: Sage Moore <sage@neuralmagic.com>
References
#23693 - [Core/DBO][1/N] Add Dual-Batch Overlap mechanism to VLLM
Author
SageMoore
Committer
SageMoore
Parents
6d76bd03
Loading