DeepSpeed
Fix: only add parameter with grads to parameter group
#7869
Merged

Fix: only add parameter with grads to parameter group #7869

tjruwase merged 6 commits into master from gma/fix_muon_partial_training
delock
delock only add parameter with grads to parameter group
f0265ef5
delock delock requested a review from tjruwase tjruwase 15 days ago
delock delock requested a review from tohtana tohtana 15 days ago
chatgpt-codex-connector
chatgpt-codex-connector commented on 2026-02-22
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 15 days ago
PKUWZP
PKUWZP requested changes on 2026-02-22
delock
delock delock requested a review from loadams loadams 15 days ago
deepspeedai deepspeedai deleted a comment from PawnOfDelock on 2026-02-23
delock delock requested a review from PKUWZP PKUWZP 15 days ago
delock
delock Remove CUDA dependency
5290ed45
delock Use distributed test
d8a792c9
delock Merge branch 'master' into gma/fix_muon_partial_training
6bb70fbc
delock
delock Merge branch 'master' into gma/fix_muon_partial_training
bde4d321
delock delock changed the title only add parameter with grads to parameter group Fix: only add parameter with grads to parameter group 12 days ago
tjruwase
tjruwase approved these changes on 2026-03-01
tjruwase tjruwase enabled auto-merge (squash) 8 days ago
tjruwase Merge branch 'master' into gma/fix_muon_partial_training
5a744ddf
tjruwase tjruwase merged 116dbe28 into master 8 days ago
tjruwase tjruwase deleted the gma/fix_muon_partial_training branch 8 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone