llama.cpp
server: save and clear idle slots on new task (`--clear-idle`)
#20993
Merged

server: save and clear idle slots on new task (`--clear-idle`) #20993

yychyo
ggml-gh-bot
yychyo
yychyo server: clear idle slots KV from VRAM (LLAMA_KV_KEEP_ONLY_ACTIVE)
c82cd357
yychyo yychyo force pushed from a9cde4ea to 231926b4 17 days ago
yychyo yychyo requested a review 17 days ago
yychyo
yychyo commented on 2026-03-26
yychyo
yychyo commented on 2026-03-26
yychyo
yychyo commented on 2026-03-26
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added server
ggerganov
ggerganov
ggerganov commented on 2026-03-26
yychyo server: move idle slot KV clearing to slot release
2563b4ae
yychyo server: add --kv-clear-idle flag, enable by default
ced2476c
yychyo yychyo requested a review 15 days ago
yychyo yychyo changed the title server : add LLAMA_KV_KEEP_ONLY_ACTIVE to clear idle slots' KV server: clear idle slots KV on release (`--kv-clear-idle`) 15 days ago
yychyo
yychyo server: skip clearing last idle slot, clear on launch
f6c4a3d2
yychyo server: test --no-kv-clear-idle flag
698e9a3c
yychyo server: simplify on-release clearing loop
d658a62c
yychyo
yychyo yychyo force pushed from b9a4e1fa to d658a62c 14 days ago
strawberrymelonpanda
yychyo
strawberrymelonpanda
yychyo
strawberrymelonpanda
ggerganov
yychyo server: remove on-release KV clearing, keep launch-only
3d338c9d
yychyo
ggerganov cont : clean-up
09ed3c03
ggerganov ggerganov requested a review from ngxson ngxson 11 days ago
ggerganov
yychyo tests: update log strings after --clear-idle rename
645aebc4
yychyo
strawberrymelonpanda
strawberrymelonpanda
strawberrymelonpanda
yychyo
strawberrymelonpanda
yychyo
ggerganov
ggerganov approved these changes on 2026-04-01
ggerganov
ggerganov commented on 2026-04-01
yychyo yychyo changed the title server: clear idle slots KV on release (`--kv-clear-idle`) server: save and clear idle slots on new task (--clear-idle) 11 days ago
yychyo yychyo changed the title server: save and clear idle slots on new task (--clear-idle) server: save and clear idle slots on new task (`--clear-idle`) 11 days ago
yychyo tests: use debug tags instead of log message matching
a22ba243
yychyo test: fix Windows CI by dropping temp log file unlink
5fc16981
ggerganov
ggerganov approved these changes on 2026-04-03
ggerganov ggerganov added merge ready
pwilkin
pwilkin approved these changes on 2026-04-03
ngxson
ngxson approved these changes on 2026-04-03
ngxson ngxson merged 50e0ad08 into master 8 days ago
Farmadupe
yychyo
ggerganov

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone