pytorch
3a38f175 - Convert DDP parameters to ReplicatedTensor during forward pass.

Commit

4 years ago

Convert DDP parameters to ReplicatedTensor during forward pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75753 As per the design in https://github.com/pytorch/pytorch/issues/72138, convert DDP parameters to ReplicatedTensor during its forward pass. Concretely, this is done as follows: 1) Create a separate `_replicated_tensor_module` which is a copy of self.module without creating copies of the Tensors themselves. 2) Use `_replicated_tensor_module` instead of `self.module` during the forward pass. 3) Have a context manager `_ddp_replicated_tensor` to enable this, since certain edge cases can fail where self.module is changed out of band resulting in discrepancy between self.module and `_replicated_tensor_module`. Differential Revision: [D35533736](https://our.internmc.facebook.com/intern/diff/D35533736/) Approved by: https://github.com/wanchaol, https://github.com/rohan-varma

Author

pritamdamania

Committer

pytorchmergebot

Parents

f4d89aa2

pytorch 3a38f175 - Convert DDP parameters to ReplicatedTensor during forward pass.

pytorch
3a38f175 - Convert DDP parameters to ReplicatedTensor during forward pass.