pytorch
b5e0fd4c - add known worker ids to distributed autograd context (#26324)

Commit

5 years ago

add known worker ids to distributed autograd context (#26324) Summary: Per https://github.com/pytorch/pytorch/issues/25525 we want to clean up distributed autograd context on all nodes, in addition to the local one. To do this, we want to send async RPCs to the other nodes telling them to clean up the context. The first step for this is for a node's context to know about the other workers. This PR does two things: 1) Adds the necessary data structures and getter functions to `DistAutogradContext` 2) Refactors calls to `addSendRpcBackward` to take in the `worker_id` as an additional argument Pull Request resolved: https://github.com/pytorch/pytorch/pull/26324 Differential Revision: D17769411 Pulled By: rohan-varma fbshipit-source-id: b7327d1209a574e2e88cb197edff3103024d51ad

Author

rohan-varma

Committer

facebook-github-bot

Parents

5321f455

pytorch b5e0fd4c - add known worker ids to distributed autograd context (#26324)

pytorch
b5e0fd4c - add known worker ids to distributed autograd context (#26324)