llama.cpp
server: add presets (config) when using multiple models
#17859
Merged

server: add presets (config) when using multiple models #17859

ServeurpersoCom
ServeurpersoCom ServeurpersoCom requested a review from ngxson ngxson 48 days ago
ServeurpersoCom ServeurpersoCom requested a review from ggerganov ggerganov 48 days ago
github-actions github-actions added examples
github-actions github-actions added server
aldehir
aldehir dismissed these changes on 2025-12-08
ngxson
ngxson commented on 2025-12-08
aldehir
aldehir commented on 2025-12-08
ngxson
ngxson commented on 2025-12-08
aldehir
aldehir
aldehir commented on 2025-12-08
ServeurpersoCom ServeurpersoCom marked this pull request as draft 48 days ago
ServeurpersoCom
ServeurpersoCom ServeurpersoCom marked this pull request as ready for review 48 days ago
ServeurpersoCom
ngxson
ServeurpersoCom
ngxson
ngxson commented on 2025-12-08
ngxson
ngxson commented on 2025-12-08
ngxson
ngxson
ngxson commented on 2025-12-08
ngxson
ngxson commented on 2025-12-08
emjomi
ServeurpersoCom
ServeurpersoCom
ServeurpersoCom
ServeurpersoCom llama-server: recursive GGUF loading
51be1fae
ServeurpersoCom server : router config POC (INI-based per-model settings)
972369e8
ServeurpersoCom server: address review feedback from @aldehir and @ngxson
d564ebf9
ServeurpersoCom server: adopt aldehir's line-oriented PEG parser
193bead2
ServeurpersoCom server: fix CLI/env duplication in child processes
a17f501c
ngxson add common/preset.cpp
31cb86a2
ngxson fix compile
a7c7aca6
ngxson cont
e5c3c471
ngxson allow custom-path models
7b962071
ngxson add falsey check
b8d8ffee
ServeurpersoCom server: fix router model discovery and child process spawning
0734bbe4
ngxson Revert "server: fix router model discovery and child process spawning"
a7baeab4
ngxson clarify about "no-" prefix
a70419c0
ngxson correct render_args() to include binary path
97de3114
ngxson also remove arg LLAMA_ARG_MODELS_PRESET for child
f645e887
ngxson add co-author for ini parser code
6bda0d47
ngxson also set LLAMA_ARG_HOST
035f56ad
ngxson add CHILD_ADDR
f2ad7dc9
ServeurpersoCom Remove dead code
b36b3fe1
ServeurpersoCom ServeurpersoCom force pushed from bf2d94cd to b36b3fe1 45 days ago
ServeurpersoCom
ngxson
ServeurpersoCom
ServeurpersoCom
ngxson
ngxson
ngxson approved these changes on 2025-12-10
ngxson ngxson dismissed their stale review 45 days ago
already approved changes related to PEG
ngxson ngxson changed the title Server: router per model config server: add presets (config) when using multiple models 45 days ago
ngxson ngxson merged f32ca51b into master 45 days ago
mdierolf
ServeurpersoCom
mdierolf
ServeurpersoCom
ngxson
mdierolf
strawberrymelonpanda
mdierolf

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone