llama.cpp
Add preemptive priority scheduling
#22851
Closed

Add preemptive priority scheduling #22851

lowlyocean wants to merge 10 commits into ggml-org:master from lowlyocean:unload_priority
lowlyocean
lowlyocean Remove sleeping before adding priority scheduling
0d99e064
lowlyocean Add preemptive priority scheduling to router mode
61da93e0
lowlyocean Fix new methods declared inside unload_lru() instead of as standalone…
f481f684
lowlyocean Fix bad_function_call bug
087eb36a
lowlyocean OK- priority is respected
653a0b31
lowlyocean Add --models-idle-timeout to replace old --sleep-idle-seconds flag
9349beb1
lowlyocean lowlyocean changed the title Feature Request: [Router mode] Preemptive Priority scheduling Add preemptive priority scheduling 40 days ago
github-actions github-actions added server/webui
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added server
lowlyocean Do not reset idle timer for /metrics, /models, /props, /health
68ad4509
lowlyocean Cleanup logs and documentation
a359e8eb
lowlyocean Merge branch 'master' into unload_priority
f401f4de
lowlyocean lowlyocean marked this pull request as ready for review 37 days ago
lowlyocean lowlyocean requested a review 37 days ago
lowlyocean lowlyocean requested a review 37 days ago
lowlyocean lowlyocean requested a review 37 days ago
lowlyocean Merge branch 'master' into unload_priority
46f63a22
lowlyocean
ServeurpersoCom
ServeurpersoCom
ngxson
ngxson ngxson closed this 36 days ago
lowlyocean

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone