onnxruntime
7be006c4 - [js/webgpu] Optimize convtranspose (#23302)

Commit
1 year ago
[js/webgpu] Optimize convtranspose (#23302) ### Description <!-- Describe your changes. --> BUG #23273 With this change, I see the convTranspose time in that bug becomes ~7s from ~90s on my Meteor Lake. This PR does below things: 1. Use stride to update the increasement in the loop. In the bug, the stride is 1024, which can greatly reduce the loop times. 2. Support components for A to reduce the memory access times. 3. When output channels is 1, the b components can be same with A to further reduce the memory access times.
Author
Parents
Loading