Use math.fsum for duration aggregation in SimpleProfiler (#21525)
* Update test_mlflow.py
* test_mlflow.py
* Optimize SimpleProfiler duration aggregation
* re run ci
* Add tests for SimpleProfiler extended report functionality
---------
Co-authored-by: bhimrazy <bhimrajyadav977@gmail.com>