llama.cpp
server: context shift
#24210
Open

server: context shift #24210

C-Prime90 wants to merge 3 commits into ggml-org:master from C-Prime90:PR-ContextShift
C-Prime90
C-Prime90 server : trigger context shift for any processing slot
51082bd4
C-Prime90 server : restore prompt truncation when context shift is enabled
6710c352
C-Prime90 C-Prime90 requested a review 15 days ago
github-actions github-actions added examples
github-actions github-actions added server
C-Prime90 C-Prime90 changed the title Pr context shift server: context shift 15 days ago
C-Prime90 server : remove can_split guard from prompt truncation
071ffc9d

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone