vllm
2f42a488 - [Feature] Support KV cache offloading and disagg prefill with LMCache connector. (#12953)

Commit
169 days ago
[Feature] Support KV cache offloading and disagg prefill with LMCache connector. (#12953)
Author
Parents
  • examples/offline_inference
    • File
      cpu_offload_lmcache.py
    • File
      disaggregated_prefill_lmcache.py
  • vllm/distributed
    • kv_transfer/kv_connector
      • File
        factory.py
      • File
        lmcache_connector.py
    • File
      parallel_state.py