onnxruntime
[Native WebGPU] Support shared memory version of ReduceOps
#24399
Merged

[Native WebGPU] Support shared memory version of ReduceOps #24399

satyajandhyala
satyajandhyala Added shared memory version of ReduceOps
076cc6cb
satyajandhyala Use the output from the context instead of a local tensor
cdf35e4b
github-actions
github-actions commented on 2025-04-11
satyajandhyala Use naive version conditionally.
6e313e26
satyajandhyala satyajandhyala marked this pull request as ready for review 256 days ago
satyajandhyala satyajandhyala added ep:WebGPU
satyajandhyala Current shared implementation does not support ArgMin/ArgMax
8728cfa8
satyajandhyala Added workgroup_size to hint.
61407903
guschmue
guschmue dismissed these changes on 2025-04-14
guschmue
satyajandhyala Added SetWorkgroupSize call.
2351268d
satyajandhyala satyajandhyala dismissed their stale review via 2351268d 253 days ago
fs-eire
fs-eire dismissed these changes on 2025-04-14
satyajandhyala Fix coner case when the input is empty.
5442d44b
satyajandhyala satyajandhyala dismissed their stale review via 5442d44b 253 days ago
satyajandhyala
satyajandhyala Simplified ReduceMean
0ed04a84
satyajandhyala Removed GetOpSpecificCode function. Using table lookup instead.
6c7bf340
satyajandhyala ArgMax/ArgMin constructor does not know the attributes before the bas…
76a9fe16
satyajandhyala satyajandhyala requested a review from fs-eire fs-eire 253 days ago
satyajandhyala satyajandhyala requested a review from guschmue guschmue 253 days ago
guschmue
guschmue approved these changes on 2025-04-15
guschmue
satyajandhyala satyajandhyala merged ff607b48 into main 252 days ago
satyajandhyala satyajandhyala deleted the sajandhy/webgpu-ep-shader-reduction-op branch 252 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone