llama.cpp
75422e8b - graph : normalize Q, K, V shapes + sync cross attention (#12449)

Commit
271 days ago
graph : normalize Q, K, V shapes + sync cross attention (#12449) * graph : normalize Q, K, V shapes and add comments ggml-ci * context : synchronize before getting cross attention data * model : fix command-r attention norm check
Author
Parents
Loading