DeepSpeed
49b6a632
- Reducing the memory-overhead of creating model for multi-GPU run (#1244)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Reducing the memory-overhead of creating model for multi-GPU run (#1244) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
References
#1244 - Reducing the memory-overhead of creating large-models for multi-GPU run
Author
RezaYazdaniAminabadi
Parents
274c375c
Loading