llama.cpp
a4854f03 - cont : improve n_cmpl logic

Commit
14 days ago
cont : improve n_cmpl logic - launch the parent task first so it finds the slot with best cache - parent task waits for child tasks to be launched - when a child task finishes - remove its cache
Author
Committer
Parents
Loading