DeepSpeed
49b6a632 - Reducing the memory-overhead of creating model for multi-GPU run (#1244)

Commit
4 years ago
Reducing the memory-overhead of creating model for multi-GPU run (#1244) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
Loading