DeepSpeed
fix: to solve #4726
#4727
Merged

Commits
  • fix: solve the problem of different dtype of loss tensors across stages of pipeline parallel.
    RUAN-ZX committed 2 years ago
  • Merge branch 'master' into master-loss-dtype
    tjruwase committed 2 years ago
Loading