llama.cpp
9c7185dd - CUDA: enable FA for FP32 KV cache (#16546)

Commit
21 days ago
CUDA: enable FA for FP32 KV cache (#16546)
Parents
Loading