onnxruntime
65e694ea - [QNN-EP] Apply Softmax layout transformation for GPU (#26233)

Commit
119 days ago
[QNN-EP] Apply Softmax layout transformation for GPU (#26233) ### Description - Transposes are inserted for Softmax with axis != output_rank-1 for the HTP backend. - The GPU backend also has this requirement on the axis param, so this change enables the layout transformation for the GPU as well. ### Motivation and Context - Enables more models with GPU backend.
Author
Parents
Loading