server : fix default draft model parameters #10586
server : force F16 KV cache for the draft model
150d6e92
server : fix draft params
f3252055
ggerganov
marked this pull request as ready for review 293 days ago
server : various params fixes
11b4d582
ggerganov
changed the title server : force F16 KV cache for the draft model server : fix default draft model parameters 293 days ago
ggerganov
merged
70b98fad
into master 293 days ago
ggerganov
deleted the gg/server-force-draft-kv-f16 branch 293 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub