[Blog] Muon Optimizer Support in DeepSpeed (#7962)
Author: @PKUWZP & @delock
Blog post introducing Muon optimizer support in DeepSpeed, covering how
it integrates with
ZeRO Stage 2/3, measured convergence and memory results, and the roadmap
ahead.
---------
Signed-off-by: Ma, Guokai <guokai.ma@intel.com>
Signed-off-by: Ma, Guokai <guokai.ma@gmail.com>
Signed-off-by: Guokai Ma <guokai.ma@intel.com>