vllm
c0c2dd1e - [BugFix] kv_offloading: Fix bug in loading of partial cpu blocks (#28951)

Commit
154 days ago
[BugFix] kv_offloading: Fix bug in loading of partial cpu blocks (#28951) Signed-off-by: Or Ozeri <oro@il.ibm.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
Author
Parents
Loading