Put Muon optimizer momentum buffer on GPU #7648
make muon optimizer totally running on GPU
bbb4bfcf
apply torch.compile to Muon optimizer
e02c0ec9
make torch.compile more adaptive to old pytorch version
632ab6b2
Merge branch 'master' into gma/muon_opti
3cbb63ce
PKUWZP
approved these changes
on 2025-11-25
delock
enabled auto-merge (squash) 36 days ago
Merge branch 'master' into gma/muon_opti
dea84926
Fix trailing space
3b6a6d92
delock
merged
7f2f4232
into master 36 days ago
delock
deleted the gma/muon_opti branch 36 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub