[9/N] [Dispatchable Collectives] Update reduce_scatter with CPU / CUDA implementations (#86166)
### Changes
- Updates for the reduce_scatter collective
### Context
https://github.com/pytorch/pytorch/issues/86225
Pull Request resolved: https://github.com/pytorch/pytorch/pull/86166
Approved by: https://github.com/kwen2501