llama.cpp
sampling : refactor + optimize penalties sampler
#10803
Merged

sampling : refactor + optimize penalties sampler #10803

ggerganov merged 10 commits into master from gg/sampling-penalties
ggerganov
github-actions github-actions added testing
github-actions github-actions added examples
github-actions github-actions added devops
github-actions github-actions added server
slaren
ggerganov ggerganov force pushed 1 year ago
ggerganov
slaren
ggerganov ggerganov requested a review from ngxson ngxson 1 year ago
ngxson
ngxson approved these changes on 2024-12-12
ggerganov
slaren
slaren commented on 2024-12-12
p-e-w
ggerganov
MaggotHATE
ggerganov
ggerganov
MaggotHATE
ggerganov
ngxson
ggerganov ggerganov force pushed to a3125686 1 year ago
ggerganov
ggerganov commented on 2024-12-13
ggerganov
ggerganov ggerganov requested a review from slaren slaren 1 year ago
p-e-w
MaggotHATE
ggerganov
ggerganov
ggerganov commented on 2024-12-14
ggerganov sampling : refactor + optimize penalties sampler
0a1f7fb6
ggerganov common : apply ignore_eos as logit bias
58a5c3bb
ggerganov batched : remove penalties sampler
a04a5b52
ggerganov params : allow penalty_last_n == -1 to be equal to context size
9847a375
ggerganov common : by default, move the penalties at the end of the sampling chain
97261aa2
ggerganov common : ignore all EOG tokens
1ff92962
ggerganov common : move back the penalties at the front of the sampling chain
685c84c3
ggerganov readme : restore hint about --ignore-eos flag [no ci]
60d26ded
ggerganov llama : minor
e27c7119
ggerganov webui : update
b58ebf30
ggerganov ggerganov force pushed from 7415f3fd to b58ebf30 1 year ago
ggerganov ggerganov merged 644fd71b into master 1 year ago
ggerganov ggerganov deleted the gg/sampling-penalties branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone