llama.cpp
tests : add non-cont K,V FA tests
#14756
Merged

Commits
  • tests : add non-cont K,V FA tests
    ggerganov committed 182 days ago
  • CUDA: fix quantized KV cache + multiple sequences (#14822)
    JohannesGaessler committed 177 days ago
Loading