llama.cpp
server : fix speculative decoding with context shift
#10641
Merged

server : fix speculative decoding with context shift #10641

ggerganov merged 3 commits into master from gg/server-fix-spec-ctx-shift
ggerganov
ggerganov server : fix speculative decoding with context shift
a5a915b5
github-actions github-actions added examples
github-actions github-actions added server
ngxson
ggerganov
ggerganov server : take into account speculative limits
b436edaa
ggerganov ggerganov force pushed to b436edaa 1 year ago
ggerganov server : add tests
81611bef
github-actions github-actions added python
ggerganov ggerganov requested a review from ngxson ngxson 1 year ago
ngxson
ngxson approved these changes on 2024-12-04
unclemusclez
josharian
ggerganov ggerganov merged 1da7b765 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone