vllm
be0c855e
- [KV Offload] Unified memory layout for offloading workers (#37206)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
13 days ago
[KV Offload] Unified memory layout for offloading workers (#37206) Signed-off-by: omerpaz95 <omerpaz95@gmail.com> Co-authored-by: Or Ozeri <oro@il.ibm.com>
References
#37206 - [KV Offload] Unified memory layout for offloading workers
Author
omerpaz95
Parents
e64b39ea
Loading