vllm
[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons).
#12405
Merged

[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). #12405

LucasWilkinson
LucasWilkinson LucasWilkinson requested a review from tlrmchlsmth tlrmchlsmth 330 days ago
LucasWilkinson LucasWilkinson requested a review from WoosukKwon WoosukKwon 330 days ago
LucasWilkinson LucasWilkinson requested a review from robertgshaw2-redhat robertgshaw2-redhat 330 days ago
LucasWilkinson LucasWilkinson requested a review from njhill njhill 330 days ago
LucasWilkinson LucasWilkinson requested a review from ywang96 ywang96 330 days ago
LucasWilkinson LucasWilkinson requested a review from comaniac comaniac 330 days ago
LucasWilkinson LucasWilkinson requested a review from alexm-redhat alexm-redhat 330 days ago
github-actions
mergify mergify added ci/build
LucasWilkinson LucasWilkinson changed the title [Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). [Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). 330 days ago
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-01-24
mergify
mergify mergify added needs-rebase
LucasWilkinson renable packed GQA
f992fef6
LucasWilkinson LucasWilkinson force pushed to f992fef6 330 days ago
mergify mergify removed needs-rebase
tlrmchlsmth tlrmchlsmth added ready
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 330 days ago
mgoin
mgoin approved these changes on 2025-01-24
tlrmchlsmth tlrmchlsmth merged 3132a933 into main 330 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone