ONNX Runtime 1.15.1 release (#16365)
### Description
The patch release will fix the following issues:
1. A coding problem in test/shared_lib/test_inference.cc that it should
use ASSERT_NEAR to test float values instead of ASSERT_EQ. Without this
change, some DNNL/OpenVino tests would fail on some AMD CPUs.
2. A misaligned error in cublasGemmBatchedHelper function. The error
only occurs when the GPU's CUDA Compute capability >=80. (In other
words: with TensorFloat-32 support)
3. A build issue that build with onnxruntime_ENABLE_MEMORY_PROFILE was
broken in 1.15.0 release.
4. Native onnxruntime library not loading in Azure App Service. It is
because in 1.15.0 we introduced a Windows API call to
SetThreadDescription. Though the API is available in all Windows 10
versions, some sandbox environments block using the API.
5. An alignment problem for xnnpack EP on Intel/AMD CPUs on PC
platforms.
6. Some training header files were missing in the 1.15.0 training NuGet
package.
7. Some fields in OrtCUDAProviderOptionsV2 struct are not initialized.
---------
Co-authored-by: cao lei <jslhcl@gmail.com>
Co-authored-by: Lei Cao <leca@microsoft.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>
Co-authored-by: Baiju Meswani <bmeswani@microsoft.com>
Co-authored-by: JiCheng <wejoncy@163.com>
Co-authored-by: Yuriy Chernyshov <thegeorg@yandex-team.com>
Co-authored-by: Artur <artur@vaadin.com>
Co-authored-by: Dale Phurrough <dale@hidale.com>
Co-authored-by: Yi Zhang <zhanyi@microsoft.com>