[Attention] MLA move rotary embedding to cuda-graph region #17668
move rot emb
f5196b5b
fix v1 torch compile
1f7ba655
v0 fix
2eb232a8
LucasWilkinson
changed the title [WIP][Attention] MLA move rotary embedding to cuda-graph region [Attention] MLA move rotary embedding to cuda-graph region 232 days ago
LucasWilkinson
marked this pull request as ready for review 232 days ago
fix pre-commit
0b7167ba
mgoin
approved these changes
on 2025-05-08
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub