DeepSpeed
379e6b82
- Add Muon pretraining convergence advantage to What is Muon section
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
31 days ago
Add Muon pretraining convergence advantage to What is Muon section Signed-off-by: Ma, Guokai <guokai.ma@gmail.com>
References
#7962 - [Blog] Muon Optimizer Support in DeepSpeed
Author
delock
Committer
delock
Parents
be3cb5d3
Loading