llama.cpp
5ca49cbe - ggml: implement quantized KV cache for FA (#7372)

Commit
2 years ago
ggml: implement quantized KV cache for FA (#7372)
Parents
Loading