llama.cpp
speculative : fix n_outputs_max and remove draft-simple auto-enable
#23988
Merged

speculative : fix n_outputs_max and remove draft-simple auto-enable #23988

ggerganov merged 5 commits into master from gg/spec-fix-n-max
ggerganov
ggerganov speculative : add common_speculative_n_max helper function
8c41b75a
ggerganov cont : draft context always has n_parallel outputs
a808e890
ggerganov llama : log n_outputs_max
016191d6
ggerganov speculative : remove draft-simple auto-enable
6476b674
ggerganov ggerganov added refactoring
github-actions github-actions added examples
github-actions github-actions added server
ggerganov ci : enable server tests on PRs
2f6f998d
github-actions github-actions added devops
ggerganov ggerganov marked this pull request as ready for review 4 days ago
ggerganov ggerganov requested a review 4 days ago
ggerganov ggerganov requested a review 4 days ago
ggerganov ggerganov requested a review 4 days ago
ggerganov ggerganov merged 5dcb7116 into master 4 days ago
ggerganov ggerganov deleted the gg/spec-fix-n-max branch 4 days ago
CISC
ServeurpersoCom
pwilkin
ServeurpersoCom
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone