vllm
Reduce the Cuda Graph memory footprint when running with DBO
#25779
Merged

Reduce the Cuda Graph memory footprint when running with DBO #25779

SageMoore
SageMoore init
1fc9de4c
SageMoore init
2f29120a
SageMoore SageMoore requested a review from WoosukKwon WoosukKwon 78 days ago
SageMoore SageMoore requested a review from robertgshaw2-redhat robertgshaw2-redhat 78 days ago
SageMoore SageMoore requested a review from njhill njhill 78 days ago
SageMoore SageMoore requested a review from ywang96 ywang96 78 days ago
SageMoore SageMoore requested a review from comaniac comaniac 78 days ago
SageMoore SageMoore requested a review from alexm-redhat alexm-redhat 78 days ago
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-09-26
tlrmchlsmth tlrmchlsmth added this to the v0.11.0 Cherry Picks milestone 78 days ago
tlrmchlsmth tlrmchlsmth added ready
SageMoore Merge branch 'main' of https://github.com/neuralmagic/vllm into sage/…
a4516dc1
ProExpertProg
ProExpertProg approved these changes on 2025-09-26
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-09-26
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 78 days ago
tlrmchlsmth tlrmchlsmth merged 4778b426 into main 78 days ago

Login to write a write a comment.

Login via GitHub