Update Torch-TRT latency report (#2006)
Summary:
- Ensure only one latency is reported per model, to avoid clashes in timing between CPU and GPU timers
- Reduce unintended latency increases from the instrumentation due to timing
Pull Request resolved: https://github.com/pytorch/benchmark/pull/2006
Reviewed By: aaronenyeshi
Differential Revision: D50600481
Pulled By: xuzhao9
fbshipit-source-id: dd256b934c6d575c1e4ab29ec25e7f767f8da2b1