llama.cpp
server : handle context overflow during decode
#17267
Merged

server : handle context overflow during decode #17267

ggerganov
github-actions github-actions added examples
github-actions github-actions added server
Base automatically changed from gg/server-fix-can-batch-with to master 33 days ago
ggerganov server : handle context overflow during decode
b9511616
ggerganov server : minor refactor
741baaf6
ggerganov ggerganov force pushed to 741baaf6 33 days ago
ggerganov ggerganov marked this pull request as ready for review 33 days ago
ggerganov ggerganov requested a review from ngxson ngxson 33 days ago
ngxson
ngxson approved these changes on 2025-11-14
ggerganov ggerganov merged 5b2093be into master 31 days ago
ggerganov ggerganov deleted the gg/server-fix-decode-error-handling branch 31 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone