llama.cpp
0a019ed8 - server: add --models-memory-max parameter to allow dynamically unloading models when they exceed a memory size threshold

Commit
15 days ago
server: add --models-memory-max parameter to allow dynamically unloading models when they exceed a memory size threshold
Author
Committer
Parents
Loading