llama.cpp
server: add KV cache quantization options
#5684
Merged

server: add KV cache quantization options #5684

AlpinDale
AlpinDale server: add KV cache quantization
9e73cc17
AlpinDale AlpinDale changed the title server: add KV cache quantization option server: add KV cache quantization options 2 years ago
ggerganov
ggerganov approved these changes on 2024-02-23
ggerganov ggerganov merged fd43d66f into master 2 years ago
AlpinDale AlpinDale deleted the server/kv-cache branch 2 years ago
K-Mistele
slaren

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone