llama.cpp
server: avoid unnecessary checkpoint invalidation for recurrent / hybrid models
#24035
Open

server: avoid unnecessary checkpoint invalidation for recurrent / hybrid models #24035

Regrad
Regrad server: improve checkpoint reuse heuristics for recurrent/hybrid models
fc6a7e0d
Regrad Regrad requested a review 2 days ago
Regrad Regrad requested a review 2 days ago
github-actions github-actions added examples
github-actions github-actions added server
Regrad Regrad closed this 2 days ago
Regrad Regrad reopened this 2 days ago
pwilkin
ggerganov
Regrad
Regrad
nssatlantis
Regrad Regrad marked this pull request as draft 22 hours ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone