DeepSpeed
636e842e - cpu_offload enables overlap_comm and contiguous_gradients

Commit
5 years ago
cpu_offload enables overlap_comm and contiguous_gradients Remove non-portable tensor.mul_()
Author
Parents
Loading