onnxruntime
b9702587 - [oneDNN] Implemented Concat Op (#13646)

Commit
3 years ago
[oneDNN] Implemented Concat Op (#13646) ### Description This PR implements the **Concat Operator** for the **OneDNN Execution Provider**. ### Motivation and Context - As part of evaluating ORT performance on ARM based targets such as Graviton3, we discovered that the OneDNN EP had some gaps on operator coverage. - The Concat Operator is fairly common and used in models such as Yolov5, MobileNet, DistillBert and GPT2 - For Yolov5 specifically, this improves average inference time over 100 runs on Graviton3 from 180.2ms to 115.5ms when using OneDNN + ARM Compute Library. Co-authored-by: Gaz Iqbal <giqbal@octoml.ai>
Author
Parents
Loading