[Distributed] Switch all_reduce to use the new functional collective op (#6887)
PyTorch has implemented a new set of functional collective ops and is planning to remove the old ops. Migrating all_reduce to use the new op.
See context in pytorch/pytorch#93173 (comment)