DeepSpeed
Add ZenFlow code for Stage 3
#7516
Merged

Add ZenFlow code for Stage 3 #7516

JoshWoo2003
JoshWoo2003 JoshWoo2003 requested a review from tjruwase tjruwase 123 days ago
JoshWoo2003 JoshWoo2003 requested a review from tohtana tohtana 123 days ago
JoshWoo2003 JoshWoo2003 requested a review from loadams loadams 123 days ago
JoshWoo2003
JoshWoo2003 JoshWoo2003 force pushed from db2dfac6 to 133290e8 123 days ago
delock
delock commented on 2025-09-05
loadams
JoshWoo2003 JoshWoo2003 force pushed from d5508141 to 47b10d8d 93 days ago
JoshWoo2003
delock
JoshWoo2003 Add support for ZenFlow with ZeRO-Stage 3
77562756
JoshWoo2003 Add ZenFlow Optimizer to support ZeRO-Stage 3
eb6deca8
JoshWoo2003 Format code
ce78292e
JoshWoo2003 Fix ZeRO-Offload init bug with missing `zenflow` argument
d73cf438
JoshWoo2003 Resolve merge conflicts and update ZenFlow Stage 3 affinity
26cc5eca
JoshWoo2003 JoshWoo2003 force pushed from 4f4e7525 to 26cc5eca 89 days ago
JoshWoo2003
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
c582cd22
sfc-gh-truwase
sfc-gh-truwase commented on 2025-09-30
sfc-gh-truwase
sfc-gh-truwase commented on 2025-09-30
sfc-gh-truwase
sfc-gh-truwase commented on 2025-09-30
JoshWoo2003 Refactor ZenFlowSelectiveAdamW to unify temp_copy_param logic
7a1fde97
JoshWoo2003 Merge remote-tracking branch 'origin/zenflow_zero3' into zenflow_zero3
956ef0a6
JoshWoo2003 Refactor ZenFlowSelectiveAdamW to unify group_step logic
2b27d50b
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
e0b4e5bc
sfc-gh-truwase
sfc-gh-truwase commented on 2025-10-03
sfc-gh-truwase
sfc-gh-truwase commented on 2025-10-03
JoshWoo2003 Refactor: deduplicate process setup and add clarifying comments
a42c56f9
JoshWoo2003 Merge branch 'zenflow_zero3' of https://github.com/JoshWoo2003/DeepSp…
15bb92e3
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
b4be176c
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
2540646d
sfc-gh-truwase
sfc-gh-truwase approved these changes on 2025-10-07
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
9d33ce0a
sfc-gh-truwase
JoshWoo2003 Fix: bugs in zf_torch_adam unit test
90eb5bfc
JoshWoo2003
sfc-gh-truwase Merge branch 'master' into zenflow_zero3
ef94a9ef
sfc-gh-truwase sfc-gh-truwase merged 7cb1b88e into master 75 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone