llama.cpp
server: args for draft model cache types (#11200)
#13782
Merged

server: args for draft model cache types (#11200) #13782

aa956
aa956 Adds server parameters for draft model cache type. Fixes ggml-org/lla…
0522270d
aa956 aa956 requested a review from ngxson ngxson 313 days ago
github-actions github-actions added examples
github-actions github-actions added server
ggerganov
ggerganov approved these changes on 2025-05-30
CISC
aa956
CISC
ggerganov ggerganov merged d67341dc into master 288 days ago
ggerganov

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone