DeepSpeed
Add Gram Newton-Schulz orthogonalization for Muon optimizer
#7953
Merged

Add Gram Newton-Schulz orthogonalization for Muon optimizer #7953

PKUWZP merged 10 commits into master from gma/gram_muon
delock
delock delock requested a review from tjruwase tjruwase 54 days ago
delock delock requested a review from tohtana tohtana 54 days ago
delock delock requested a review from loadams loadams 54 days ago
chatgpt-codex-connector
chatgpt-codex-connector commented on 2026-04-03
delock delock force pushed from 4fe6a0b4 to d17212ef 54 days ago
delock Add Gram Newton-Schulz iteration for Muon optimizer
381d8b7a
delock docs: add ns_method parameter to Muon optimizer documentation
e9beb2de
delock fix: correct Gram Newton-Schulz reference URL
d17212ef
delock Use accelerator API for dtype selection in Newton-Schulz iterations
54930203
delock Fix non-contiguous tensor output from Gram NS for tall matrices
e5de42ce
delock Fold transpose into matmul in Gram NS for tall matrices
a6cf6b69
delock Use fused addmm and eliminate eye allocation in Gram NS
61095611
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 53 days ago
delock
delock Merge branch 'master' into gma/gram_muon
dbc9ac9b
delock Merge branch 'master' into gma/gram_muon
83f18009
PKUWZP
PKUWZP approved these changes on 2026-04-30
PKUWZP Merge branch 'master' into gma/gram_muon
a6b0c704
PKUWZP PKUWZP merged 8a77f381 into master 27 days ago
PKUWZP PKUWZP deleted the gma/gram_muon branch 27 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone