DeepSpeed
_exec_forward_pass: place zeros(1) on the same device as the param
#5576
Merged

Loading