llama.cpp
server: support multiple generations from one prompt (OAI "n" option)
#17775
Merged

server: support multiple generations from one prompt (OAI "n" option) #17775

ngxson merged 11 commits into ggml-org:master from ngxson:xsn/add_n_support
ngxson
ngxson backend support
15ce574d
ngxson server: support multiple generations from one prompt (OAI "n" option)
0d842cb5
github-actions github-actions added examples
github-actions github-actions added server
ngxson fix invalid batch
bf33d13b
ngxson format oai
a768a5e8
ngxson clean up
5cc3156f
ngxson disable ctx shift
2a7728f5
ngxson add test
e0660710
ngxson update comments
46f6fd26
ngxson ngxson marked this pull request as ready for review 189 days ago
ngxson ngxson requested a review from ggerganov ggerganov 189 days ago
ngxson
allozaur
ngxson
github-actions github-actions added python
ngxson fix style
b65ee647
ITankForCAD
ngxson
ngxson add n_cmpl to docs [no ci]
6fb3226d
ggerganov
ggerganov approved these changes on 2025-12-06
ggerganov
ngxson allowing using both n_cmpl and n
ea7f0669
ngxson
ngxson ngxson merged c42712b0 into master 188 days ago
jacekpoplawski
ServeurpersoCom
ngxson
ServeurpersoCom
ggerganov
ngxson
ngxson commented on 2026-01-07
ServeurpersoCom

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone