onnxruntime
eb8f6c7c - Transpose optimizer enhancements (#15117)

Commit

2 years ago

Transpose optimizer enhancements (#15117) ### Description  - Add debug infrastructure to dump out model at various stages of transpose optimization. - Handle more scenarios where Transpose -> Reshape can be merged. - Run L1 optimizers after layout transform to constant fold initializers that had their layout changed. - Use cost check for Concat post layout transform as pushing a Transpose through it can potentially add Transpose nodes to multiple other inputs. - Update internal testing EP to support test where you want it to take all nodes, use NHWC layout, and to use dummy static kernels instead of compiling so the ops in the graph post-initialization can be counted. - Misc cleanup in InferenceSession to not unnecessarily pass args to TransposeGraph for class members. ### Motivation and Context  Address perf issue seen with model where a Transpose gets blocked by a Reshape that could have been treated as a Transpose. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

References

#15117 - Transpose optimizer enhancements

Author

skottmckay

Parents

792d4111

onnxruntime eb8f6c7c - Transpose optimizer enhancements (#15117)

onnxruntime
eb8f6c7c - Transpose optimizer enhancements (#15117)