[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). #12405
LucasWilkinson
changed the title [Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). [Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons). 330 days ago
renable packed GQA
f992fef6
mgoin
approved these changes
on 2025-01-24
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub