DeepSpeed
49b6a632 - Reducing the memory-overhead of creating model for multi-GPU run (#1244)

Commit

4 years ago

Reducing the memory-overhead of creating model for multi-GPU run (#1244) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

References

Author

RezaYazdaniAminabadi

Parents