onnxruntime
d912167c - [NV TRT RTX EP] Cumulative TRT RTX EP merge (#25656)

Commit
290 days ago
[NV TRT RTX EP] Cumulative TRT RTX EP merge (#25656) This currently holds 2 major improvements: - dynamic shape models should have much lower memory usage and in addition to that the management is move towards ORT allocators - the overhead for shape binding and address updates is reduce per inference --------- Co-authored-by: Gaurav Garg <gaugarg@nvidia.com>
Author
Parents
Loading