Revert all_reduce workaround as it might be causing issues on other parts of the codebase (#96460)
Recent master breakage on focal and bionic PTD tests since we switched to all_reduce in #95897
Pull Request resolved: https://github.com/pytorch/pytorch/pull/96460
Approved by: https://github.com/fegin