sampling : support multiple outputs per sequence #19833
danbev
marked this pull request as draft 8 days ago
sampling : support multiple outputs per sequence
1138d5c2
llama : add n_sampling_outputs_max cparam
1e8c02aa
llama : enable static graph for multiple sampling outputs per sequence
765998f2
server : enable backend sampling for multiple outputs per sequence
2235b4be
sampling : add clamping to backend dist sampler
5c92c76e
danbev
force pushed
from
6c6b36bc
to
5c92c76e
5 days ago
danbev
marked this pull request as ready for review 5 days ago
Assignees
No one assigned
Labels
testing
examples
server
Login to write a write a comment.
Login via GitHub