server: add KV cache quantization options #5684
server: add KV cache quantization
9e73cc17
AlpinDale
changed the title server: add KV cache quantization option server: add KV cache quantization options 2 years ago
ggerganov
approved these changes
on 2024-02-23
ggerganov
merged
fd43d66f
into master 2 years ago
AlpinDale
deleted the server/kv-cache branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub