llama.cpp
context : reserve new scheduler when graph topology changes
#18547
Open

context : reserve new scheduler when graph topology changes #18547

ggerganov wants to merge 11 commits into master from gg/llama-reserve
ggerganov
danbev
danbev approved these changes on 2026-01-02
Base automatically changed from gg/metal-adjust-fa-extra-size to master 12 days ago
ggerganov ggerganov force pushed from 400466c0 to bd5de6ba 12 days ago
ggerganov ggerganov force pushed from 89d19e00 to c92df391 10 days ago
ggerganov ggerganov force pushed from c92df391 to cf2b3cae 9 days ago
ggerganov ggerganov force pushed from cf2b3cae to 4b744105 3 days ago
ggerganov context : reserve new scheduler when graph topology changes
e115c637
ggerganov cont : fix
7b526420
ggerganov cont : fix reserve
94426b2e
ggerganov cont : reserve only when changes occur + timing
03e9d66c
ggerganov context : add comments
5260bb79
ggerganov llama : reserve on sampler changes
0c0d0fdc
ggerganov common : allow null common_sampler
b579b970
ggerganov ggerganov force pushed from 4b744105 to b579b970 2 days ago
ggerganov ggerganov requested a review from ngxson ngxson 2 days ago
ggerganov
github-actions github-actions added examples
github-actions github-actions added server
ngxson
ngxson commented on 2026-01-12
ggerganov server : task declares needs (embd, logits, sampling)
ffa0d15e
ggerganov server : do not init sampler if not needed
be9e6ef2
ggerganov llama : fix need_reserve when unsetting a sampler
3084bfe6
ggerganov server : consolidate slot reset/clear logic
d9146ed2

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone