llama.cpp
server: add margin for draft model for `fit`
#23485
Merged

Commits
  • server: add margin for draft model for `fit`
    am17an committed 36 days ago
  • clarify worst case memory usage for MTP context
    am17an committed 36 days ago
  • use params.devices if not empty
    am17an committed 35 days ago
Loading