vllm
Reduce the Cuda Graph memory footprint when running with DBO
#25779

Merged

Reduce the Cuda Graph memory footprint when running with DBO #25779

tlrmchlsmth merged 3 commits into vllm-project:main from neuralmagic:sage/dbo-cudagraph-size

init

1fc9de4c

init

2f29120a

SageMoore requested a review from

WoosukKwon 78 days ago

SageMoore requested a review from

robertgshaw2-redhat 78 days ago

SageMoore requested a review from

njhill 78 days ago

SageMoore requested a review from

ywang96 78 days ago

SageMoore requested a review from

comaniac 78 days ago

SageMoore requested a review from

alexm-redhat 78 days ago

mergify added v1

gemini-code-assist commented on 2025-09-26

tlrmchlsmth added this to the v0.11.0 Cherry Picks milestone 78 days ago

tlrmchlsmth added ready

Merge branch 'main' of https://github.com/neuralmagic/vllm into sage/…

a4516dc1

ProExpertProg approved these changes on 2025-09-26

tlrmchlsmth approved these changes on 2025-09-26

tlrmchlsmth enabled auto-merge (squash) 78 days ago

tlrmchlsmth merged 4778b426 into main 78 days ago

Reviewers

tlrmchlsmth

ProExpertProg

gemini-code-assist

WoosukKwon

robertgshaw2-redhat

njhill

ywang96

comaniac

alexm-redhat

Assignees

No one assigned

Labels

ready v1

Milestone

v0.11.0 Cherry Picks