DeepSpeed
Refactor moe/non-moe gradient reduction
#1811
Merged

Commits
  • Refactor moe/non-moe gradient reduction
    tjruwase committed 4 years ago
  • Merge branch 'master' into olruwase/simplify_engine_reduction
    tjruwase committed 4 years ago
Loading