llama.cpp
39173bca - context : reserve new scheduler when graph topology changes (#18547)

Commit
2 days ago
context : reserve new scheduler when graph topology changes (#18547) * context : reserve new scheduler when graph topology changes * cont : fix * cont : fix reserve * cont : reserve only when changes occur + timing * context : add comments * llama : reserve on sampler changes * common : allow null common_sampler * server : task declares needs (embd, logits, sampling) * server : do not init sampler if not needed * llama : fix need_reserve when unsetting a sampler * server : consolidate slot reset/clear logic
Author
Parents
Loading