llama.cpp
2f74c354 - graph : make FA compatible with MLA + add initial Metal kernels (#12953)

Commit
244 days ago
graph : make FA compatible with MLA + add initial Metal kernels (#12953) * graph : make mla compatible with FA * metal : add exp FA kernels for DeepSeek models ggml-ci * llama : minor naming updates ggml-ci * ggml : disable FA for DS head sizes * tests : add FA tests for MLA shapes ggml-ci
Author
Parents
Loading