llama.cpp
graph : make FA compatible with MLA + add initial Metal kernels
#12953
Merged

graph : make FA compatible with MLA + add initial Metal kernels #12953

ggerganov merged 5 commits into master from gg/mla
ggerganov
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added Vulkan
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
jukofyork
ggerganov graph : make mla compatible with FA
e3308567
ggerganov metal : add exp FA kernels for DeepSeek models
9b64dccf
ggerganov llama : minor naming updates
9cc85ddb
ggerganov ggml : disable FA for DS head sizes
43c762bb
ggerganov tests : add FA tests for MLA shapes
facdf870
ggerganov ggerganov force pushed from 2cbc16d1 to facdf870 1 year ago
ggerganov ggerganov merged 2f74c354 into master 1 year ago
ggerganov ggerganov deleted the gg/mla branch 1 year ago
Panchovix

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone