onnxruntime
ae501eeb - [Native WebGPU EP] Add packedQKV and do_rotary attribute support to GroupQueryAttention operator (#23386)

Commit
335 days ago
[Native WebGPU EP] Add packedQKV and do_rotary attribute support to GroupQueryAttention operator (#23386) ### Description Add Packed QKV inputs and do_rotary attribute to GQA. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Packed QKV inputs and do_rotary attribute are required for certain models.
Parents
Loading