DeepSpeed
Put Muon optimizer momentum buffer on GPU
#7648
Merged

Put Muon optimizer momentum buffer on GPU #7648

delock merged 6 commits into master from gma/muon_opti
delock
delock make muon optimizer totally running on GPU
bbb4bfcf
delock delock requested a review from tjruwase tjruwase 69 days ago
delock delock requested a review from tohtana tohtana 69 days ago
delock
delock apply torch.compile to Muon optimizer
e02c0ec9
delock make torch.compile more adaptive to old pytorch version
632ab6b2
delock
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 59 days ago
PKUWZP
delock
delock
delock Merge branch 'master' into gma/muon_opti
3cbb63ce
PKUWZP
PKUWZP approved these changes on 2025-11-25
delock delock enabled auto-merge (squash) 36 days ago
delock Merge branch 'master' into gma/muon_opti
dea84926
delock Fix trailing space
3b6a6d92
delock delock merged 7f2f4232 into master 36 days ago
delock delock deleted the gma/muon_opti branch 36 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone