server : do not default to multiple slots with speculative decoding #17017
server : do not default to multiple slots with speculative decoding
caa4ca7a
cont : fix
3ce702f6
ggerganov
merged
13b339bc
into master 224 days ago
ggerganov
deleted the server/fix-draft-slots branch 224 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub