DeepSpeed
optimize grad_norm calculation in stage3.py
#4436
Merged

optimize grad_norm calculation in stage3.py #4436

mmhab
mmhab optimize grad_norm calculation
a56838a2
mmhab mmhab requested a review from jeffra jeffra 2 years ago
mmhab mmhab requested a review from tjruwase tjruwase 2 years ago
mmhab mmhab requested a review from samyam samyam 2 years ago
mmhab mmhab requested a review from mrwyattii mrwyattii 2 years ago
tjruwase
tjruwase commented on 2023-10-03
tjruwase Merge branch 'master' into optimize_grad_norm_calc
671de1cd
tjruwase Merge branch 'master' into optimize_grad_norm_calc
ecfda40d
tjruwase Merge branch 'master' into optimize_grad_norm_calc
62d52aa8
tjruwase
tjruwase approved these changes on 2023-12-13
mmhab Merge branch 'master' into optimize_grad_norm_calc
d78ee555
mmhab Merge branch 'master' into optimize_grad_norm_calc
6ac9f7d0
mmhab
mrwyattii Merge branch 'master' into optimize_grad_norm_calc
f180a44d
mrwyattii
mrwyattii
mrwyattii commented on 2023-12-18
tjruwase
tjruwase commented on 2023-12-18
mrwyattii Update deepspeed/runtime/zero/stage3.py
93461bb7
mrwyattii Update deepspeed/runtime/zero/stage3.py
038d7555
mrwyattii Merge branch 'master' into optimize_grad_norm_calc
8cca05b8
tjruwase Merge branch 'master' into optimize_grad_norm_calc
799e9a24
mmhab optimize grad_norm calculation
7ef1be9a
mmhab
Merge branch 'master' into optimize_grad_norm_calc
753b3533
tjruwase tjruwase merged ea0d8114 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone