onnxruntime
c8789d30 - [ROCm] static re-hipify of CUDA EP to ROCm EP, now a shared provider (#8877)

Commit
4 years ago
[ROCm] static re-hipify of CUDA EP to ROCm EP, now a shared provider (#8877) * re-hipify all rocm EP sources * fix all other files affected by re-hipify * add cuda_provider_factory.h to amd_hipify.py * do not use cudnn_conv_algo_search in ROCm EP, missing reduce min registration * Fix ReduceConsts template specialization introduced in #9101. Fixes the error when building for ROCm 4.3.1: error: too many template headers for onnxruntime::rocm::ReduceConsts<__half>::One (should be 0) * fix flake8 error in amd_hipify.py * speed up hipify with concurrent.futures * flake8 fix in amd_hipify.py
Author
Parents
Loading