vllm
2f42a488
- [Feature] Support KV cache offloading and disagg prefill with LMCache connector. (#12953)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
169 days ago
[Feature] Support KV cache offloading and disagg prefill with LMCache connector. (#12953)
References
#12953 - [Feature] Support KV cache offloading and disagg prefill with LMCache connector.
Author
YaoJiayi
Parents
3173c3b3
Files
5
examples/offline_inference
cpu_offload_lmcache.py
disaggregated_prefill_lmcache.py
vllm/distributed
kv_transfer/kv_connector
factory.py
lmcache_connector.py
parallel_state.py
Loading