[webgpu] Optimize FlashAttention for prefill #25395
[webgpu] Optimize FlashAttention for prefill
3063b2e2
qjia7
commented
on 2025-07-15
qjia7
commented
on 2025-07-15
qjia7
dismissed these changes
on 2025-07-15
Explicitly set `is_unidirectional_` to true for GQA
7bd19af3
daijh
dismissed their stale review
via 7bd19af3
191 days ago
fs-eire
approved these changes
on 2025-07-22
guschmue
merged
2bd00ec4
into main 177 days ago
daijh
deleted the optimize-flash-attention-for-prefill branch 177 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub