DeepSpeed
Handle parameter groups smaller than DP
#273
Merged

Commits
  • Load non-DeepSpeed checkpoints into ZeRO optimizer
    tjruwase committed 5 years ago
  • Handle parameters smaller than DP
    tjruwase committed 5 years ago
  • Formatting fixes
    tjruwase committed 5 years ago
  • Merge branch 'master' into olruwase/zero_relevance_bug
    tjruwase committed 5 years ago
Loading