[Native WebGPU] Support shared memory version of ReduceOps #24399
Added shared memory version of ReduceOps
076cc6cb
Use the output from the context instead of a local tensor
cdf35e4b
Use naive version conditionally.
6e313e26
satyajandhyala
marked this pull request as ready for review 256 days ago
Current shared implementation does not support ArgMin/ArgMax
8728cfa8
Added workgroup_size to hint.
61407903
guschmue
dismissed these changes
on 2025-04-14
Added SetWorkgroupSize call.
2351268d
fs-eire
dismissed these changes
on 2025-04-14
Fix coner case when the input is empty.
5442d44b
Simplified ReduceMean
0ed04a84
Removed GetOpSpecificCode function. Using table lookup instead.
6c7bf340
ArgMax/ArgMin constructor does not know the attributes before the bas…
76a9fe16
guschmue
approved these changes
on 2025-04-15
satyajandhyala
deleted the sajandhy/webgpu-ep-shader-reduction-op branch 252 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub