NV TensorRT RTX EP - initial commit (#24456)
New EP - currently based on existing TensorRT EP but meant to be used on
RTX GPUs with a lean version of TensorRT.
### Description
Adding a new EP based on TensorRT EP. This is going to use a special
version of TensorRT optimized for RTX GPUs. In the future we plan to
make changes to the EP to streamline it further (e.g, get rid of
dependency on CUDA EP completely).
### Motivation and Context
The new TensorRT for RTX is going to have:
1. Much smaller footprint
2. Much faster model compile/load times.
3. Better usability in terms of use of cached models across multiple RTX
GPUs.
This effort is also targeting WCR ML workflows.
---------
Co-authored-by: Maximilian Müller <maximilianm@nvidia.com>
Co-authored-by: Gaurav Garg <gaugarg@nvidia.com>
Co-authored-by: iraut <iraut@nvidia.com>
Co-authored-by: Hrishikesh Manohar <hrishikeshm@nvidia.com>
Co-authored-by: Maximilian Müller <44298237+gedoensmax@users.noreply.github.com>