pytorch
a33eac39 - [NCCL] Add experimental Nonblocking NCCL Fault Tolerance/Checking (#95715)

Commit
1 year ago
[NCCL] Add experimental Nonblocking NCCL Fault Tolerance/Checking (#95715) Support for nonblocking NCCL communicators/fault tolerance/checking which was added in 2.14 as an experimental feature. Enabled via the environment variable: ``` TORCH_NCCL_USE_COMM_NONBLOCKING=1 ``` CC @ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/95715 Approved by: https://github.com/kwen2501
Author
eqy eqy
Committer
Parents
Loading