vllm
Enable prefix caching with full cuda graphs
#19617
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
Enable prefix caching with full cuda graphs
#19617
WoosukKwon
merged 4 commits into
main
from
full-cuda-graph-prefix-caching
[Bugfix] Enable prefix caching with full cuda graphs
e8f07e50
minor
b10f335f
gemini-code-assist
commented on 2025-06-13
WoosukKwon
changed the title
Full cuda graph prefix caching
Enable prefix caching with full cuda graphs
218 days ago
WoosukKwon
added
ready
gemini-code-assist
commented on 2025-06-13
mergify
added
needs-rebase
houseroad
commented on 2025-06-14
merge
23b2d387
WoosukKwon
requested a review
from
hmellor
217 days ago
WoosukKwon
requested a review
from
njhill
217 days ago
WoosukKwon
requested a review
from
LiuXiaoxuanPKU
217 days ago
WoosukKwon
requested a review
from
alexm-redhat
217 days ago
WoosukKwon
requested a review
from
comaniac
217 days ago
WoosukKwon
requested a review
from
robertgshaw2-redhat
217 days ago
WoosukKwon
requested a review
from
ywang96
217 days ago
WoosukKwon
requested a review
from
tlrmchlsmth
217 days ago
WoosukKwon
requested a review
from
aarnphm
217 days ago
Merge branch 'main' into full-cuda-graph-prefix-caching
e56aad45
mergify
added
documentation
mergify
added
ci/build
mergify
added
frontend
mergify
added
llama
mergify
added
rocm
mergify
added
structured-output
mergify
added
speculative-decoding
mergify
added
v1
mergify
removed
needs-rebase
ywang96
approved these changes on 2025-06-15
WoosukKwon
enabled auto-merge (squash)
216 days ago
disabled auto-merge
216 days ago
Manually disabled by user
WoosukKwon
merged
055915e6
into main
216 days ago
WoosukKwon
deleted the full-cuda-graph-prefix-caching branch
216 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
ywang96
houseroad
gemini-code-assist
hmellor
njhill
LiuXiaoxuanPKU
alexm-redhat
comaniac
robertgshaw2-redhat
tlrmchlsmth
aarnphm
Assignees
No one assigned
Labels
documentation
rocm
structured-output
frontend
speculative-decoding
ready
ci/build
v1
llama
Milestone
No milestone
Login to write a write a comment.
Login via GitHub