Adding profiling capability to c++ ddp collective functions (#46471)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/46471
ghstack-source-id: 116018837
Test Plan:
Added unit tests:
buck test mode/dev-nosan caffe2/test/distributed:distributed_gloo_fork
buck test mode/dev-nosan caffe2/test/distributed:distributed_nccl_fork
Reviewed By: rohan-varma
Differential Revision: D23948397
fbshipit-source-id: 6d93a370aff26bf96c39e5d78a2492c5142a9156