llama.cpp
a04c2b06 - server: improve slots scheduling for n_cmpl (#18789)

Commit
3 days ago
server: improve slots scheduling for n_cmpl (#18789) * server : make sure children tasks are scheduled to launch with parent * fix * add comment pointing to this PR * fix * clean up * more debug messages * add pop_deferred_task with specific ID version * improve the logic * simple approach * no double move * correct return type of launch_slots_with_parent_task
Author
Parents
Loading