DeepSpeed
Update to new torch grad hook API: BF16Optimizer and Stage2
#7189
Merged

Update to new torch grad hook API: BF16Optimizer and Stage2 #7189

deepcharm
deepcharm deepcharm requested a review from tjruwase tjruwase 266 days ago
deepcharm deepcharm requested a review from tohtana tohtana 266 days ago
deepcharm Avoid graph break by removing redundant requires_grad attr change
8d2ca5e9
deepcharm Small fix
5cf1933d
deepcharm Revert "Small fix"
4ac85311
fix leak of z3 buffer
8a883d7e
inkcherry hf tp+zero training doc. (#7151)
8e24314c
tohtana Add destroy to tests to free memory (#7160)
6cc4cdf9
c8ef [NFC] Typo fix in SP layer. (#7152)
acafeec8
hwchen2017 Link AutoTP blog in the front page (#7167)
e53ae846
stas00 fix `seq_parallel_communication_data_type` constant. (#7175)
36692e6a
loadams Fix typos in GDS blog (#7177)
1812b8ca
bm-synth Variable batch size and LR scheduler (#7104)
2cbb0715
loadams Update version.txt after 0.16.5 release (#7180)
c1d084e9
hwchen2017 Cross layer overlapping for domino (#7178)
1d218c83
inkcherry async tp allreduce (#7115)
ea4829bc
Glaceon-Hyy Fix issue #5242 grad_norm and loss is nan (#7171)
34f20e80
deepcharm Update to new torch grad hook API: BF16Optimizer and Stage2
28701b37
deepcharm deepcharm force pushed from ebdad97b to 28701b37 266 days ago
deepcharm deepcharm requested a review from loadams loadams 266 days ago
deepcharm deepcharm requested a review from GuanhuaWang GuanhuaWang 266 days ago
deepcharm deepcharm requested a review from hwchen2017 hwchen2017 266 days ago
deepcharm Merge branch 'master' into use-new-grad-acc-api
88589deb
tjruwase
tjruwase approved these changes on 2025-03-31
loadams loadams merged 79ff1627 into master 264 days ago
deepcharm deepcharm deleted the use-new-grad-acc-api branch 189 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone