DeepSpeed
stage3.py: do not scale if gradient_predivide_factor is 1.0
#3630
Merged

stage3.py: do not scale if gradient_predivide_factor is 1.0 #3630

guoyejun
guoyejun stage3.py: do not scale if gradient_predivide_factor is 1.0
acbd6701
guoyejun guoyejun requested a review from jeffra jeffra 2 years ago
guoyejun guoyejun requested a review from tjruwase tjruwase 2 years ago
guoyejun guoyejun requested a review from samyam samyam 2 years ago
guoyejun guoyejun requested a review from mrwyattii mrwyattii 2 years ago
tjruwase
tjruwase approved these changes on 2023-05-30
tjruwase Merge branch 'master' into noscalefor1.0
4a4c6b9b
tjruwase tjruwase merged 52907a66 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone