pytorch
7245d2c9 - Avoid scatter for single-device case in DDP (#46304)

Commit

4 years ago

Avoid scatter for single-device case in DDP (#46304) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46304 In the case that a single process operates only on one GPU, we can avoid this scatter and instead replace it with a recursive version of `to` which transfers the input tensors to the correct device. The implementation of `_recursive_to` is modeled after `scatter` in https://github.com/pytorch/pytorch/blob/master/torch/nn/parallel/scatter_gather.py, in order to keep parity with the previous conventions (i.e. custom types not having their tensors moved). ghstack-source-id: 114896677 Test Plan: Added unittest, and CI Reviewed By: pritamdamania87 Differential Revision: D24296377 fbshipit-source-id: 536242da05ecabfcd36dffe14168b1f2cf58ca1d

Author

rohan-varma

Committer

facebook-github-bot

Parents

e5a2ba2e

Files3

torch
- nn/parallel
  - distributed.py
  - scatter_gather.py
- testing/_internal/distributed
  - distributed_test.py

pytorch 7245d2c9 - Avoid scatter for single-device case in DDP (#46304)

pytorch
7245d2c9 - Avoid scatter for single-device case in DDP (#46304)