llama.cpp
Add flash attention MMA / Tiles to support MiMo-V2.5
#22812
Merged

Add flash attention MMA / Tiles to support MiMo-V2.5 #22812

am17an merged 5 commits into ggml-org:master from AesSedai:mimo-v2.5-fattn
AesSedai
AesSedai mimo-v2.5: add flash attention mma/tiles for for d_kq=192 d_v=128
7ed99606
AesSedai AesSedai requested a review 11 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added python
github-actions github-actions added ggml
coder543
coder543
JohannesGaessler
JohannesGaessler commented on 2026-05-07
AesSedai
AesSedai mimo-v2.5: follow (256, 256) fattn templates
df4abcd4
AesSedai mimo-v2.5: cleanup comments
a5bf4294
AesSedai mimo-v2.5: further comment cleanup
0eb8cbe3
AesSedai
JohannesGaessler
JohannesGaessler commented on 2026-05-07
coder543
AesSedai mimo-v2.5: address PR feedback
67f3e8a4
AesSedai AesSedai requested a review from ggerganov ggerganov 11 days ago
github-actions github-actions added testing
AesSedai
JohannesGaessler
JohannesGaessler approved these changes on 2026-05-08
AesSedai
JohannesGaessler
am17an
am17an approved these changes on 2026-05-09
am17an am17an merged 046e2844 into master 10 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone