llama.cpp
graph : make FA compatible with MLA + add initial Metal kernels
#12953
Merged

graph : make FA compatible with MLA + add initial Metal kernels #12953

ggerganov merged 5 commits into master from gg/mla
ggerganov
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added Vulkan
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
jukofyork
ggerganov graph : make mla compatible with FA
e3308567
ggerganov metal : add exp FA kernels for DeepSeek models
9b64dccf
ggerganov llama : minor naming updates
9cc85ddb
ggerganov ggml : disable FA for DS head sizes
43c762bb
ggerganov tests : add FA tests for MLA shapes
facdf870
ggerganov ggerganov force pushed from 2cbc16d1 to facdf870 350 days ago
ggerganov ggerganov merged 2f74c354 into master 350 days ago
ggerganov ggerganov deleted the gg/mla branch 350 days ago
Panchovix

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone