llama.cpp
Add flash attention MMA / Tiles to support MiMo-V2.5
#22812
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
Add flash attention MMA / Tiles to support MiMo-V2.5
#22812
am17an
merged 5 commits into
ggml-org:master
from
AesSedai:mimo-v2.5-fattn
mimo-v2.5: add flash attention mma/tiles for for d_kq=192 d_v=128
7ed99606
AesSedai
requested a review
11 days ago
github-actions
added
Nvidia GPU
github-actions
added
python
github-actions
added
ggml
JohannesGaessler
commented on 2026-05-07
mimo-v2.5: follow (256, 256) fattn templates
df4abcd4
mimo-v2.5: cleanup comments
a5bf4294
mimo-v2.5: further comment cleanup
0eb8cbe3
JohannesGaessler
commented on 2026-05-07
mimo-v2.5: address PR feedback
67f3e8a4
AesSedai
requested a review
from
ggerganov
11 days ago
github-actions
added
testing
JohannesGaessler
approved these changes on 2026-05-08
am17an
approved these changes on 2026-05-09
am17an
merged
046e2844
into master
10 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
am17an
JohannesGaessler
ggerganov
Assignees
No one assigned
Labels
testing
Nvidia GPU
python
ggml
Milestone
No milestone
Login to write a write a comment.
Login via GitHub