server : fix speculative decoding with context shift #10641
server : fix speculative decoding with context shift
a5a915b5
server : take into account speculative limits
b436edaa
ggerganov
force pushed
to
b436edaa
1 year ago
server : add tests
81611bef
ngxson
approved these changes
on 2024-12-04
ggerganov
merged
1da7b765
into master 1 year ago
Assignees
No one assigned
Labels
examples
python
server
Login to write a write a comment.
Login via GitHub