vllm
[Core][v1] Unify allocating slots in prefill and decode in KV cache manager
#12608
Merged

[Core][v1] Unify allocating slots in prefill and decode in KV cache manager #12608

ShawnD200
ShawnD200 Add _untouch() to reverse touch() if not enough blocks
cf0cac2b
ShawnD200 Combine allocate_slots and append_slots
5cedce5b
ShawnD200 Delete append_slots
a482f5db
ShawnD200 Modify test case in prefix caching
23772e92
ShawnD200 Address static checkers
075f1b5b
ShawnD200 ShawnD200 requested a review from WoosukKwon WoosukKwon 1 year ago
ShawnD200 ShawnD200 requested a review from robertgshaw2-redhat robertgshaw2-redhat 1 year ago
ShawnD200 ShawnD200 requested a review from njhill njhill 1 year ago
ShawnD200 ShawnD200 requested a review from ywang96 ywang96 1 year ago
ShawnD200 ShawnD200 requested a review from comaniac comaniac 1 year ago
ShawnD200 ShawnD200 requested a review from alexm-redhat alexm-redhat 1 year ago
github-actions
comaniac comaniac assigned comaniac comaniac 1 year ago
comaniac
comaniac requested changes on 2025-01-31
ShawnD200 Address reviewer comments
c7ef0037
ShawnD200 Remove _untouch
8b2172a0
ShawnD200
comaniac
comaniac approved these changes on 2025-02-01
mergify mergify added v1
ShawnD200 Address comments
b4256766
ShawnD200
comaniac comaniac added ready
comaniac
DarkLight1337 DarkLight1337 merged f8ece6e1 into main 1 year ago
ShawnD200 ShawnD200 deleted the unify-prefill-and-decode branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone