llama.cpp
13b339bc - server : do not default to multiple slots with speculative decoding (#17017)

Commit
134 days ago
server : do not default to multiple slots with speculative decoding (#17017) * server : do not default to multiple slots with speculative decoding * cont : fix
Author
Parents
Loading