DeepSpeed
Fix ZeRO stage 1 and add stage 2 support with DeepCompile
#7366
Merged

Fix ZeRO stage 1 and add stage 2 support with DeepCompile #7366

loadams merged 20 commits into master from tohtana/dc_z1_no_sync
tohtana
keep real inputs for partial recompilation
5921295c
tohtana Merge branch 'master' into tohtana/keep_real_inputs_for_recompile
ab9aad51
fix format
a7897cf2
keep gradient through gradient accumulation period
653cfded
rename functions to consolidate z1 and z2
6d9fedbe
add zero2
333a385f
add common functions
cdb54fdb
Merge branch 'master' of github.com:deepspeedai/DeepSpeed
0a2a71af
Merge branch 'master' into tohtana/dc_z0_no_sync
06237f05
tohtana tohtana requested a review from tjruwase tjruwase 211 days ago
tohtana tohtana requested a review from loadams loadams 211 days ago
tohtana tohtana requested a review from jomayeri jomayeri 211 days ago
tohtana Merge branch 'master' into tohtana/dc_z1_no_sync
021f3565
stas00 Merge branch 'master' into tohtana/dc_z1_no_sync
2a12ceac
Merge branch 'master' into tohtana/dc_z1_no_sync
1c44b8e5
fix release of ipg buffer
ae744bdd
remove name of unused variable
d77ae129
fix format
2d013237
Merge branch 'tohtana/fix_zero_bucket' into tohtana/dc_z1_no_sync
1829ec71
tohtana Merge branch 'master' into tohtana/dc_z1_no_sync
b6c70f92
Merge branch 'tohtana/dc_z1_no_sync' of github.com:deepspeedai/DeepSp…
33455cfb
loadams Merge branch 'master' into tohtana/dc_z1_no_sync
1a94fbf8
loadams Merge branch 'master' into tohtana/dc_z1_no_sync
44620054
loadams
loadams approved these changes on 2025-06-27
loadams loadams enabled auto-merge (squash) 200 days ago
loadams loadams merged be8124c8 into master 200 days ago
loadams loadams deleted the tohtana/dc_z1_no_sync branch 200 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone