[Core][v1] Unify allocating slots in prefill and decode in KV cache manager #12608
Add _untouch() to reverse touch() if not enough blocks
cf0cac2b
Combine allocate_slots and append_slots
5cedce5b
Delete append_slots
a482f5db
Modify test case in prefix caching
23772e92
Address static checkers
075f1b5b
comaniac
requested changes
on 2025-01-31
Address reviewer comments
c7ef0037
Remove _untouch
8b2172a0
comaniac
approved these changes
on 2025-02-01
Address comments
b4256766
ShawnD200
deleted the unify-prefill-and-decode branch 1 year ago
Login to write a write a comment.
Login via GitHub