DeepSpeed
54110305 - Inference Checkpoints in V2 (#4664)

Commit
2 years ago
Inference Checkpoints in V2 (#4664) Add capability to snapshot an engine and resume from it, reducing load times for large models. Includes new unit tests to validate this pipeline on a small scale. --------- Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com> Co-authored-by: Ammar Ahmad Awan <ammar.awan@microsoft.com> Co-authored-by: Masahiro Tanaka <mtanaka@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Reza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Reza Yazdani <reyazda@microsoft.com>
Author
Parents
Loading