llama.cpp
6ab397e1 - graph : support non-contiguous Q in build_attn_mha (#15908)

Commit
4 days ago
graph : support non-contiguous Q in build_attn_mha (#15908) * support non-contiguous Q in build_attn_mha * Update src/llama-graph.cpp ggml-ci Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Author
Parents
Loading