Fix TRT EP's cuda graph feature (#17355)

Commit

2 years ago

Fix TRT EP's cuda graph feature (#17355) When users run inference with cuda graph enable with multithreading, only the main thread creating the inference session will successfully initialize cuda graph instance, for other threads executing the inference run directly, they will hit segfault due to not calling allocation/initialization for cuda graph instance. This PR fixes this issue.