DeepSpeed
b08cf416 - skip torch.zeros and tensor.copy_ when model parallel is not used (#2479)

Commit
3 years ago
skip torch.zeros and tensor.copy_ when model parallel is not used (#2479) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Author
Parents
Loading