pytorch
f0a55007 - [torch/elastic] Add logging to the sanitize function of RendezvousStateHolder (#58169)

Commit
3 years ago
[torch/elastic] Add logging to the sanitize function of RendezvousStateHolder (#58169) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58169 This PR adds logging to the `_sanitize()` function of `RendezvousStateHolder` to output the nodes that had no recent heartbeat and are considered "dead". ghstack-source-id: 128798389 Test Plan: Run the existing tests. Reviewed By: tierex Differential Revision: D28333394 fbshipit-source-id: ba0a398a759815e4224b58323c0e743eb383f723
Author
Parents
Loading