onnxruntime
Fix CUDA Attention dispatch: skip MEA when head_size != v_head_size in GQA
#28358
Merged

Fix CUDA Attention dispatch: skip MEA when head_size != v_head_size in GQA #28358

justinchuby
justinchuby Fix CUDA Attention dispatch: skip MEA when head_size != v_head_size i…
32b357d2
justinchuby justinchuby requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 6 days ago
titaiwangms titaiwangms requested a review from titaiwangms titaiwangms 6 days ago
tianleiwu
tianleiwu approved these changes on 2026-05-05
justinchuby justinchuby merged 1f257837 into main 6 days ago
justinchuby justinchuby deleted the fix-attention-head-size-mismatch branch 6 days ago
titaiwangms

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone