[WebGPU/JSEP] Support group query attention do_rotary attribute #23524
Added do_rotary attribute support to GQA.
361df45c
Apply rotary embedding before transposing to to BNSH
cc317ca4
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
c95b8468
minor changes.
7952ddb9
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
ba0d6399
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
3241118e
A fixed the pos_id type.
a95c3d85
minor change
845469b7
Fixed hint for generate positionIDs.
d829e71d
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
4edf220d
minor bug
49b765e2
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
2bc125aa
satyajandhyala
marked this pull request as ready for review 1 year ago
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
85ce62a6
Fixed GeneratePositionIds code.
ee39ae0e
satyajandhyala
marked this pull request as ready for review 1 year ago
lint
c2a83360
satyajandhyala
changed the title [WIP][WebGPU/JSEP] Support group query attention do_rotary attribute [WebGPU/JSEP] Support group query attention do_rotary attribute 1 year ago
guschmue
approved these changes
on 2025-03-07
satyajandhyala
deleted the sajandhy/webgpu_group_query_attention_do_rotary branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub