llama.cpp
Server: when no slot is available, defer the task instead of returning "slot unavailable"
#5018
Merged

Commits
  • server: defer task when no slot is available
    Xuan Son Nguyen committed 2 years ago
  • remove unnecessary log
    Xuan Son Nguyen committed 2 years ago
Loading