DeepSpeed
cfead551 - fixes #2389 (#2411)

Commit
2 years ago
fixes #2389 (#2411) truncating expert param storage for checkpointing Co-authored-by: Alexander Jipa <azzhipa@amazon.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Alexander Jipa
Parents
Loading