pytorch
a80e5e78 - Update ideep for future performance improvement (#87966)

Commit

2 years ago

Update ideep for future performance improvement (#87966) **Summary** The update includes API changes and optimzations to reduce framework overhead, which will benefit all mkldnn (onednn) ops in JIT mode and inductor CPU backend, etc. These benefits will be seen after switching to new ideep API by future PRs. **Test plan** For correctness, all UTs that call mkldnn ops, including test_ops.py, test_mkldnn*.py, test_quantization.py, etc. For performance, TorchBench has been run and no regression is found. Results are shown below. - Intel (R) Xeon (R) IceLake with 40 cores - Use multi-instance - Using tcmalloc & Intel OMP ![image](https://user-images.githubusercontent.com/12522207/201631004-bb77468d-953b-4757-a001-94d44615b5f6.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87966 Approved by: https://github.com/jgong5, https://github.com/XiaobingSuper

Author

yanbing-j

Committer

pytorchmergebot

Parents

31708a73

pytorch a80e5e78 - Update ideep for future performance improvement (#87966)

pytorch
a80e5e78 - Update ideep for future performance improvement (#87966)