llama.cpp
PoC server with fully functional router, model load/unload (multiple models in parallel)
#37
Open

Loading