llama.cpp
server : add `n_discard` parameter to specify the number of tokens to discard when context is shifted
#6300
Merged

server : add `n_discard` parameter to specify the number of tokens to discard when context is shifted #6300

kaetemi
kaetemi server : add `n_discard` parameter to specify the number of tokens to…
082611a2
ngxson
kaetemi
ggerganov
ggerganov approved these changes on 2024-03-26
ggerganov ggerganov merged 3d032ece into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone