DeepSpeed
7e480ea0 - offload++ blog (#623) (#4637)

Commit
1 year ago
offload++ blog (#623) (#4637) This PR is the blog for ZeRO-Offload++, it describes the details of how our new Twin-Flow feature works and its performance numbers on both DGX-A100 and DGX-H100 machines. Corresponding code PR is https://github.com/microsoft/DeepSpeed/pull/4636 cc @jeffra @awan-10 @tjruwase @mrwyattii --------- Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Author
Parents
Loading