DeepSpeed
ee4deabd - ZeRO2-Offload: Load balance gradient copying to CPU (#1067)

Commit
4 years ago
ZeRO2-Offload: Load balance gradient copying to CPU (#1067) * Round robin partitioning to improve ZeRO-2 Offload CPU copy * Formatting fixes * Fix index issues in debug dumps * Remove debug prints * Code cleanup * Remove unintended stage3.py changes * Add TODO
Author
Parents
Loading