vllm
[MISC] Add prefix cache hit rate to metrics
#7606
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
Hide Comment Changes
Previous Change (CTRL+↑)
Next Change (CTRL+↓)
Expand Context Lines
Collapse Context Lines
Hide Minimap (CTRL+M)
Files
16
Threads
tests
core/block
test_prefix_caching_block.py
prefix_caching
test_prefix_caching.py
vllm
core
block
common.py
cpu_gpu_block_allocator.py
interfaces.py
naive_block.py
prefix_caching_block.py
block_manager_v1.py
block_manager_v2.py
embedding_model_block_manager.py
evictor_v2.py
interfaces.py
scheduler.py
engine
llm_engine.py
metrics.py
metrics_types.py
Loading comments...
Loading