DeepSpeed
de473091 - [Blog] Muon Optimizer Support in DeepSpeed (#7962)

Commit
44 days ago
[Blog] Muon Optimizer Support in DeepSpeed (#7962) Author: @PKUWZP & @delock Blog post introducing Muon optimizer support in DeepSpeed, covering how it integrates with ZeRO Stage 2/3, measured convergence and memory results, and the roadmap ahead. --------- Signed-off-by: Ma, Guokai <guokai.ma@intel.com> Signed-off-by: Ma, Guokai <guokai.ma@gmail.com> Signed-off-by: Guokai Ma <guokai.ma@intel.com>
Author
Parents
Loading