llama.cpp
1da7b765
- server : fix speculative decoding with context shift (#10641)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
310 days ago
server : fix speculative decoding with context shift (#10641) * server : fix speculative decoding with context shift ggml-ci * server : take into account speculative limits ggml-ci * server : add tests
References
#10641 - server : fix speculative decoding with context shift
Author
ggerganov
Parents
59f4db10
Loading