onnxruntime
9799c3fb - [webgpu] Enable FlashAttention for GQA (#23761)

Commit
1 year ago
[webgpu] Enable FlashAttention for GQA (#23761) ### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
Author
Parents
Loading