llama.cpp
sampling : support multiple outputs per sequence
#19833
Open

sampling : support multiple outputs per sequence #19833

danbev
danbev danbev requested a review from ggerganov ggerganov 8 days ago
danbev danbev requested a review from CISC CISC 8 days ago
danbev danbev marked this pull request as draft 8 days ago
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added server
danbev sampling : support multiple outputs per sequence
1138d5c2
danbev llama : add n_sampling_outputs_max cparam
1e8c02aa
danbev llama : enable static graph for multiple sampling outputs per sequence
765998f2
danbev server : enable backend sampling for multiple outputs per sequence
2235b4be
danbev sampling : add clamping to backend dist sampler
5c92c76e
danbev danbev force pushed from 6c6b36bc to 5c92c76e 5 days ago
danbev danbev marked this pull request as ready for review 5 days ago
danbev danbev requested a review from ngxson ngxson 5 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone