onnxruntime
e8bf46a7 - [WebGPU EP] Support GroupQueryAttention (#22658)

Commit
1 year ago
[WebGPU EP] Support GroupQueryAttention (#22658) ### Description <!-- Describe your changes. --> Support GroupQueryAttention operator for native webgpu ep. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> This is required for inferencing some LLMs.
Parents
Loading