ZeRO3: Improve mismatch detection #7525
Detect list len mismatches
641d86fa
Revert
b0b6bd6c
Z3 sanity check option
cbf3d661
Revert
ceef8756
Minor tweaks
1a11c187
Improve error message format
ffdccf26
Improve error message format
498e69c2
stas00
approved these changes
on 2025-08-29
Update deepspeed/runtime/zero/utils.py
d6b3b74d
Update deepspeed/runtime/engine.py
5aad5745
PR feedback
0b6145fc
Add list length
948f7775
Merge branch 'master' into sfc-gh-truwase/detect_z3_state_mismatch
05f1e970
sfc-gh-truwase
deleted the sfc-gh-truwase/detect_z3_state_mismatch branch 121 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub