DeepSpeed
fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter
#7878
Open

fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter #7878

nathon-lee wants to merge 6 commits into deepspeedai:master from nathon-lee:fix_cp_7807
nathon-lee
nathon-lee fix: Ensure full gradient reduction for Muon with reduce_scatter
1dc41225
nathon-lee Update stage_1_and_2.py
a873854e
tohtana Fix ZeRO stage to choose BF16 optimizer in test (#7803)
f6ddd754
nathon-lee Update stage_1_and_2.py
15996a95
nathon-lee Merge branch 'deepspeedai:master' into fix_cp_7807
196c7ae5
nathon-lee Merge branch 'deepspeedai:master' into fix_cp_7807
4665aa93
nathon-lee nathon-lee requested a review from tjruwase tjruwase 11 days ago
nathon-lee nathon-lee requested a review from tohtana tohtana 11 days ago
chatgpt-codex-connector
chatgpt-codex-connector commented on 2026-02-27
tjruwase tjruwase requested a review from PKUWZP PKUWZP 8 days ago
nathon-lee nathon-lee changed the title Fix cp 7807 fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter 4 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone