onnxruntime
e01a8519 - [DML EP] Re-architect | Partitioning as Transformer (#13131)

Commit
3 years ago
[DML EP] Re-architect | Partitioning as Transformer (#13131) ### Description Re-architect DML EP to allow ORT L2/L3 transformers. This change includes: - During ORT graph partitioning, DML EP will only set the dmlExecutionProvider to all eligible nodes. - Moved DML specific operator transformer as L2 transformer - Introduced a new DMLGraphFusionTransformer, applicable only for DML EP, which is responsible to - partition the graph - fuse each partition into a IDMLCompiledOperator - register the kernel for each partition ### Motivation and Context - Why is this change required? What problem does it solve? It enables ORT L2/L3 transformers for DML EP, which will increase the perf of Transformer-based models. - If it fixes an open issue, please link to the issue here. N/A Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>
Author
Parents
Loading