fix iteration timing used in autotuning when gradient_accumulation_steps > 1 (#2888)
* fix iteration timing when gas > 1
* fix formatting
---------
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>