llama.cpp
server : do not default to multiple slots with speculative decoding
#17017
Merged

server : do not default to multiple slots with speculative decoding #17017

ggerganov merged 2 commits into master from server/fix-draft-slots
ggerganov
ggerganov server : do not default to multiple slots with speculative decoding
caa4ca7a
ggerganov ggerganov requested a review from ngxson ngxson 224 days ago
github-actions github-actions added examples
github-actions github-actions added server
pockers21
ggerganov cont : fix
3ce702f6
ggerganov ggerganov merged 13b339bc into master 224 days ago
ggerganov ggerganov deleted the server/fix-draft-slots branch 224 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone