DeepSpeed
Fix potential memory issues when use deepspeed Z3
#6726
Merged

Fix potential memory issues when use deepspeed Z3 #6726

loadams merged 9 commits into deepspeedai:master from wenbinc-Bin:fix_z3
wenbinc-Bin
wenbinc-Bin wenbinc-Bin requested a review from tjruwase tjruwase 1 year ago
wenbinc-Bin
wenbinc-Bin
tjruwase
tjruwase commented on 2024-11-12
tjruwase
tjruwase commented on 2024-11-12
tjruwase tjruwase requested a review from tohtana tohtana 1 year ago
tjruwase
wenbinc-Bin
wenbinc-Bin wenbinc-Bin force pushed from 12189feb to 93ef46ad 1 year ago
wenbinc-Bin wenbinc-Bin requested a review from loadams loadams 1 year ago
wenbinc-Bin wenbinc-Bin force pushed from 93ef46ad to 7cd4f335 1 year ago
wenbinc-Bin Set "ds_grads_remaining" to 0 when module doesn't have this variable
d75d1f5c
wenbinc-Bin Set "__n_available_params" to 0 in release_and_reset_all()
b83df91c
wenbinc-Bin Add unit stage3 test for running model twice in one step
9f0d2aee
wenbinc-Bin wenbinc-Bin force pushed from 7cd4f335 to 9f0d2aee 1 year ago
wenbinc-Bin
tjruwase
tjruwase approved these changes on 2024-11-18
tjruwase Merge branch 'master' into fix_z3
ad60f943
loadams Merge branch 'master' into fix_z3
8b30987d
loadams Merge branch 'master' into fix_z3
7834887a
loadams Merge branch 'master' into fix_z3
51a69c07
loadams loadams enabled auto-merge 1 year ago
wenbinc-Bin Fix dtype error
7cabf80a
disabled auto-merge 1 year ago
Head branch was pushed to by a user without write access
hwchen2017 Merge branch 'master' into fix_z3
ed40ac77
loadams loadams merged cd20a3bb into master 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone