[oneDNN] Implemented Concat Op (#13646)

Commit

3 years ago

[oneDNN] Implemented Concat Op (#13646) ### Description This PR implements the **Concat Operator** for the **OneDNN Execution Provider**. ### Motivation and Context - As part of evaluating ORT performance on ARM based targets such as Graviton3, we discovered that the OneDNN EP had some gaps on operator coverage. - The Concat Operator is fairly common and used in models such as Yolov5, MobileNet, DistillBert and GPT2 - For Yolov5 specifically, this improves average inference time over 100 runs on Graviton3 from 180.2ms to 115.5ms when using OneDNN + ARM Compute Library. Co-authored-by: Gaz Iqbal <giqbal@octoml.ai>

References

#13646 - [oneDNN] Implemented Concat Op

Author

gaziqbal

Parents

c2d08fd7

onnxruntime b9702587 - [oneDNN] Implemented Concat Op (#13646)

onnxruntime
b9702587 - [oneDNN] Implemented Concat Op (#13646)