server : add `n_discard` parameter to specify the number of tokens to discard when context is shifted #6300
server : add `n_discard` parameter to specify the number of tokens to…
082611a2
ggerganov
approved these changes
on 2024-03-26
ggerganov
merged
3d032ece
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub