Release 1.10.0 cherry pick round 1 (#9886)
* Fix memset size (#9840)
(cherry picked from commit d012d9fb913230205620771c94cb6f5cb89a4203)
* [js/web] do not use nodejs type 'Buffer' in web (#9839)
* [js/web] do not use nodejs type 'Buffer' in web
* resolve comments and validate tests
* remove 'Buffer' in test
(cherry picked from commit a3ebc5e08290b4000e8509d21819d7f486e23a14)
* Fix potential data race with OrtValue usage in Python (#9841)
(cherry picked from commit 18fd2cf4571061b61115b47b524ae6828057f95e)
* [OpenVINO-EP] V3.4 Release with OpenVINO 2021.4.2 LTS Release (#9848)
* Changes to ensure openvino build go through in Windows
* Modified Hetero plugin Logic
*Modified Hetero Feature logic. In Hetero,
if the operator to be marked true in getcapability(),
it should be supported by either of the devices
specified with HETERO in the device_type.
Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>
* OV updated to 2021.4.2 version
* OV updated to 2021.4.2 version
* Updated OV to 2021.4.2 version, mono download link and dotnet version
* Copying Managed nugets in openvino c# docker file
*Copying Managed nuget to nugets artifacts
directory
Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com>
Co-authored-by: saharfraza <sfatima.3001@gmail.com>
Co-authored-by: mayavijx <mayax.vijayan@intel.com>
Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com>
(cherry picked from commit 0ae0f29f140c5d7b4077df024da75abd30367e58)
* no fallback when enforcing explicit EP registration. (#9863)
* no fallback when enforcing explicit EP registration.
* add explicit ep registrations for python.
(cherry picked from commit 1e9e57df3efddc2fc41540e79149e670ac5304c6)
* layernorm throw error if input has no data (#9837)
(cherry picked from commit bf716e667c43d5091af5bd2418eb7bdbcb9d4c23)
* [js/node] npm audit fix (#9861)
(cherry picked from commit 27e337ed7a0ca55126ac2f3a250a5ab16e54ea63)
* [python manylinux package] emit warning if missing CUDA/TensorRT dependency causes ld_preload to fail and user tries to register either CUDA/TensorRT EP (#9872)
* add warning if ld_preload fails for CUDA or TRT when trying to register either provider
* refactor
* change wording from register to create
(cherry picked from commit ec9b0ed800650113a35be5ecf2d68394dbfe6ad1)
* QDQ tool modification part2 (#9720)
* Add finetuned qdq options
* Add description
* Add unit tests
* Modify for channel axis
* Remove too specific feature. Move this implementation to e2e example
* Add OpTypesSupportPerChannelQuantization
* fix bug for unit test
* Keep flags OpTypesSupportPerChannelQuantization and QDQChannelAxis for internal use
Will have a follow-up PR to fine tune the code
* remove unnecessary warning
Co-authored-by: stevenlix <38092805+stevenlix@users.noreply.github.com>
Co-authored-by: Yufeng Li <liyufeng1987@gmail.com>
(cherry picked from commit 0baf687f2df01bab8f3160c464b26ac383d9dff3)
* Cancel transpose optimizer for resize (#9870)
* cancel transpose optimizer for resize
* add UT
* addressing comments
* fix build err
(cherry picked from commit 16bfd3c771c417fa047aa0e551dc493818c2bb4c)
* Add build option to enable cuda profiling (#9875)
(cherry picked from commit 9345894c823de67f913d907da91dbc07ac322041)
Co-authored-by: Dmitri Smirnov <yuslepukhin@users.noreply.github.com>
Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>
Co-authored-by: Hariharan Seshadri <shariharan91@gmail.com>
Co-authored-by: Maajid khan <n.maajidkhan@gmail.com>
Co-authored-by: George Wu <jywu@microsoft.com>
Co-authored-by: Ye Wang <52801275+wangyems@users.noreply.github.com>
Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com>
Co-authored-by: RandySheriffH <48490400+RandySheriffH@users.noreply.github.com>