llama.cpp
server: introduce API for serving / loading / unloading multiple models
#17470
Merged

server: introduce API for serving / loading / unloading multiple models #17470

ngxson
ngxson server: add model management and proxy
fc5901a4
ngxson fix compile error
399f536d
ngxson does this fix windows?
abc0ca47
ngxson fix windows build
54b35457
ngxson use subprocess.h, better logging
5423d42a
ngxson add test
0ef3b61e
ngxson fix windows
7c6eb17f
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
919d3f8c
allozaur feat: Model/Router server architecture WIP
55d33a8b
ngxson more stable
b9ebdf61
ngxson fix unsafe pointer
6610724f
ngxson also allow terminate loading model
d0ea9e08
ngxson add is_active()
5805ca79
allozaur refactor: Architecture improvements
8a885768
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
c35dee3b
ngxson tmp apply upstream fix
21614086
ngxson address most problems
5369aaa1
ngxson address thread safety issue
6929c9f4
ngxson address review comment
be25bccd
ngxson add docs (first version)
cd5c6993
ngxson address review comment
a2e912cf
allozaur feat: Improved UX for model information, modality interactions etc
4bf82a10
allozaur chore: update webui build output
cc88f6a7
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
45bf2a49
allozaur refactor: Use only the message data `model` property for displaying m…
049f40df
allozaur chore: update webui build output
c26c3402
ngxson add --models-dir param
032b9ff4
allozaur feat: New Model Selection UX WIP
8b1d9675
allozaur chore: update webui build output
6b7c0a50
allozaur feat: Add auto-mic setting
69503aa5
allozaur feat: Attachments UX improvements
92585c71
ngxson implement LRU
62ee883d
ngxson remove default model path
7cd92907
ngxson better --models-dir
72415588
ngxson add env for args
b0540e8e
ngxson address review comments
525e2746
ngxson fix compile
457fbdac
allozaur refactor: Chat Form Submit component
c274f132
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
f2ca54b2
ngxson ad endpoint docs
d32bbfec
ngxson Merge remote-tracking branch 'webui/allozaur/server_model_management_…
4af1b6cb
allozaur feat: Add copy to clipboard to model name in model info dialog
076eec6d
allozaur feat: Model unavailable UI state for model selector
db8ed5df
allozaur feat: Chat Form Actions UI logic improvements
dc913ec4
allozaur feat: Auto-select model from last assistant response
a39ef24c
allozaur chore: update webui build output
036cc939
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
6282537a
ngxson expose args and exit_code in API
f25bfaba
ngxson add note
7ef6312f
ngxson support extra_args on loading model
f927e21f
ngxson allow reusing args if auto_load
74685f41
ngxson typo docs
f95f9c51
ngxson oai-compat /models endpoint
2e355c7f
ngxson cleaner
5ad594e6
ngxson address review comments
d65be917
allozaur feat: Use `model` property for displaying the `repo/model-name` namin…
1f0cb3ab
allozaur refactor: Attachments data
b7ba13b6
allozaur chore: update webui build output
48dbef17
allozaur refactor: Enum imports
1c214e9a
allozaur feat: Improve Model Selector responsiveness
ef5f9d07
allozaur chore: update webui build output
49c8062d
allozaur refactor: Cleanup
d5a6671b
allozaur refactor: Cleanup
f8ff39c6
allozaur refactor: Formatters
41764b8f
allozaur chore: update webui build output
219fd19e
allozaur refactor: Copy To Clipboard Icon component
e92ce079
allozaur chore: update webui build output
fb5445e9
allozaur refactor: Cleanup
39fb1c2b
allozaur chore: update webui build output
188d3236
allozaur refactor: UI badges
16747dee
allozaur chore: update webui build output
e808f2b2
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
76557cd5
allozaur refactor: Cleanup
13fe8607
allozaur refactor: Cleanup
b2590a7f
allozaur chore: update webui build output
5ef3f990
ngxson add --models-allow-extra-args for security
6ed192b4
ngxson nits
2c6b58f7
ngxson add stdin_file
539cbf00
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
399b39f2
ngxson fix merge
e514b86d
ngxson ngxson requested a review from ggerganov ggerganov 209 days ago
ngxson ngxson requested a review from allozaur allozaur 209 days ago
angt
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
11c26ecf
allozaur fix: Retrieve lost setting after resolving merge conflict
7db3d874
github-actions github-actions added script
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added python
github-actions github-actions added server
DocShotgun
ngxson
ngxson
ngxson commented on 2025-11-24
allozaur refactor: DatabaseStore -> DatabaseService
ccd6c271
allozaur refactor: Database, Conversations & Chat services + stores architectu…
fed6c82e
allozaur refactor: Remove redundant settings
f9c911d0
allozaur refactor: Multi-model business logic WIP
501badc9
allozaur chore: update webui build output
4c24ead8
allozaur feat: Switching models logic for ChatForm or when regenerating messge…
b9a3129d
allozaur chore: update webui build output
01324493
allozaur fix: Add `untrack` inside chat processing info data logic to prevent …
82975a1f
allozaur fix: Regenerate
33356f36
allozaur feat: Remove redundant settigns + rearrange
c680083c
allozaur fix: Audio attachments
5207527e
allozaur refactor: Icons
22507fed
allozaur chore: update webui build output
81b8e1ab
allozaur feat: Model management and selection features WIP
2a280b60
allozaur chore: update webui build output
19e5385b
allozaur refactor: Improve server properties management
b1cf8bb8
allozaur refactor: Icons
23a91cd2
allozaur chore: update webui build output
d0d7a88d
allozaur feat: Improve model loading/unloading status updates
284557cd
allozaur chore: update webui build output
9431f358
allozaur refactor: Improve API header management via utility functions
ddf98bdf
ngxson remove support for extra args
e40f35fb
ngxson set hf_repo/docker_repo as model alias when posible
e2731c37
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
becc6026
allozaur refactor: Remove ConversationsService
42483f46
allozaur refactor: Chat requests abort handling
456828b3
allozaur refactor: Server store
d6ee3d13
ngxson tmp webui build
1493ee09
allozaur refactor: Model modality handling
13e79884
allozaur chore: update webui build output
2a5922b1
allozaur refactor: Processing state reactivity
6b95118a
allozaur fix: UI
69065ddc
allozaur refactor: Services/Stores syntax + logic improvements
6a3d6e79
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
78ead498
allozaur refactor: Architecture cleanup
d7335373
allozaur feat: Improve statistic badges
9086bc30
allozaur feat: Condition available models based on modality + better model loa…
db479523
allozaur docs: Architecture documentation
bc577266
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
bdaf44a1
allozaur feat: Update logic for PDF as Image
491fe2d3
ngxson add TODO for http client
7be833da
allozaur refactor: Enhance model info and attachment handling
eed1bd9b
allozaur chore: update webui build output
3470b12b
allozaur refactor: Components naming
5fadd0fe
allozaur chore: update webui build output
04ef4a06
allozaur refactor: Cleanup
1cf5daa8
allozaur refactor: DRY `getAttachmentDisplayItems` function + fix UI
68b653ef
allozaur chore: update webui build output
171a0926
allozaur fix: Modality detection improvement for text-based PDF attachments
dd30810d
allozaur refactor: Cleanup
1adf173d
allozaur docs: Add info comment
2f97dbfa
allozaur refactor: Cleanup
c76de5e0
allozaur re
4d16459b
allozaur refactor: Cleanup
f50ce7b5
allozaur refactor: Cleanup
d49d97c6
allozaur feat: Attachment logic & UI improvements
648d2dee
allozaur refactor: Constants
27b15226
allozaur feat: Improve UI sidebar background color
2464e060
allozaur chore: update webui build output
ce9c9afe
allozaur refactor: Utils imports + move types to `app.d.ts`
493ef087
allozaur test: Fix Storybook mocks
2d556bb9
allozaur chore: update webui build output
a568e74c
allozaur Merge branch 'master' into allozaur/server_model_management_v1_2
33b9cc40
allozaur test: Update Chat Form UI tests
4f39da82
allozaur refactor: Tooltip Provider from core layout
949b5fd6
allozaur refactor: Tests to separate location
ae8a1e81
allozaur Merge remote-tracking branch 'origin/allozaur/server_model_management…
6fd720e7
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
c1dfccd0
ngxson decouple server_models from server_routes
a82dbbfb
allozaur test: Move demo test to tests/server
360a5ed6
allozaur refactor: Remove redundant method
acd3c581
allozaur chore: update webui build output
e8b9d74b
ngxson also route anthropic endpoints
23cb4113
ngxson Merge remote-tracking branch 'webui/allozaur/server_model_management_…
802e77ea
ngxson fix duplicated arg
7b28b5e1
ngxson fix invalid ptr to shutdown_handler
4a1c05c3
ggerganov server : minor
d182544c
ggerganov
ggerganov approved these changes on 2025-12-01
ngxson rm unused fn
f2dbe9c0
ngxson add ?autoload=true|false query param
c3304075
ngxson Merge branch 'master' into xsn/server_model_management_v1_2
05cc22f0
allozaur refactor: Remove redundant code
689ca09b
allozaur Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2…
7a95348d
allozaur
ngxson
allozaur docs: Update README documentations + architecture & data flow diagrams
73056fb6
allozaur fix: Disable autoload on calling server props for the model
c49467a3
allozaur chore: update webui build output
9d3b718e
ngxson fix ubuntu build
a6d3f83e
danbev
danbev commented on 2025-12-01
allozaur fix: Model status reactivity
b926cfa3
allozaur fix: Modality detection for MODEL mode
01ed8ced
allozaur chore: update webui build output
b10d9508
allozaur
ngxson
l2k36hk
l2k36hk dismissed these changes on 2025-12-01
allozaur allozaur dismissed their stale review 202 days ago
Not a review from maintainer
allozaur
allozaur approved these changes on 2025-12-01
allozaur
ngxson ngxson merged ec18edfc into master 202 days ago
ngxson
aviallon
allozaur
romaingyh
ServeurpersoCom
strawberrymelonpanda
strawberrymelonpanda
anagri
ServeurpersoCom

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone