Consolidate TensorRT subgraphs to reduce inference overhead (#11211)
* add trt node list consolidation
* add more log
* fix typo
* seperate cycle detection and removal
* update
* change function name
Co-authored-by: Ubuntu <azureuser@orttrtlinuxdev.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>