DeepSpeed
88c319aa - Handle parameter groups smaller than DP (#273)

Commit
5 years ago
Handle parameter groups smaller than DP (#273) * Load non-DeepSpeed checkpoints into ZeRO optimizer * Handle parameters smaller than DP * Formatting fixes
Author
Parents
Loading