vllm
4778b426
- Reduce the Cuda Graph memory footprint when running with DBO (#25779)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
73 days ago
Reduce the Cuda Graph memory footprint when running with DBO (#25779) Signed-off-by: Sage Moore <sage@neuralmagic.com>
References
#25779 - Reduce the Cuda Graph memory footprint when running with DBO
Author
SageMoore
Parents
c70ac4b8
Loading