server: args for draft model cache types (#11200) #13782
Adds server parameters for draft model cache type. Fixes ggml-org/lla…
0522270d
ggerganov
approved these changes
on 2025-05-30
ggerganov
merged
d67341dc
into master 288 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub