DeepSpeed
allow seperate learning rate "muon_lr" and "adam_lr" for muon optimizer
#7658
Merged

allow seperate learning rate "muon_lr" and "adam_lr" for muon optimizer #7658

delock merged 5 commits into master from gma/muon_sep_lr
delock
delock delock requested a review from tjruwase tjruwase 63 days ago
delock delock requested a review from tohtana tohtana 63 days ago
delock allow seperate learning rate "muon_lr" and "adam_lr" for muon optimizer
c94b8226
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 62 days ago
PKUWZP
PKUWZP approved these changes on 2025-10-30
delock
sfc-gh-truwase Merge branch 'master' into gma/muon_sep_lr
b813c937
delock Merge branch 'master' into gma/muon_sep_lr
df0d9132
loadams
loadams approved these changes on 2025-11-05
delock Merge branch 'master' into gma/muon_sep_lr
c2b7c2b5
delock Merge branch 'master' into gma/muon_sep_lr
65bc4853
delock delock merged df59f203 into master 51 days ago
delock delock deleted the gma/muon_sep_lr branch 51 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone