llama.cpp
9c7185dd
- CUDA: enable FA for FP32 KV cache (#16546)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
21 days ago
CUDA: enable FA for FP32 KV cache (#16546)
References
#16546 - CUDA: enable FA for FP32 KV cache
Author
JohannesGaessler
Parents
1ee9d0b4
Loading