Reduce profiling runs on distributed models (#1243)
Summary:
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1243
Many of the models are fairly large, and 10 iterations is way too much
(chrometrace take sa long time to load and freezes). 2 is probably
enough for most applications and even on the largest models the results
are reasonable.
Test Plan: Imported from OSS
Reviewed By: xuzhao9
Differential Revision: D40705586
Pulled By: davidberard98
fbshipit-source-id: b188e9efe5013558cd681631edfdfb8d93d343fd