PR #5684 server: add KV cache quantization options

server: add KV cache quantization options #5684

ggerganov merged 1 commit into ggml-org:master from AlpinDale:server/kv-cache

server: add KV cache quantization

9e73cc17

AlpinDale changed the title ~~server: add KV cache quantization option~~ server: add KV cache quantization options 2 years ago

ggerganov approved these changes on 2024-02-23

ggerganov merged fd43d66f into master 2 years ago

AlpinDale deleted the server/kv-cache branch 2 years ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

None yet

Milestone

No milestone