DeepSpeed
stage3: efficient compute of scaled_global_grad_norm
#5256
Merged

stage3: efficient compute of scaled_global_grad_norm #5256

nelyahu
stage3: efficient compute of scaled_global_grad_norm
eebeaf3d
nelyahu nelyahu requested a review from tjruwase tjruwase 1 year ago
nelyahu nelyahu requested a review from mrwyattii mrwyattii 1 year ago
tjruwase Merge branch 'master' into stage_3_scaled_global_norm_calc
c353b9ef
tjruwase
tjruwase approved these changes on 2024-04-12
tjruwase
Fix formatting: remove unused import
e559d629
nelyahu
tjruwase Merge branch 'master' into stage_3_scaled_global_norm_calc
ff9f46ab
tjruwase tjruwase merged 54c06872 into master 1 year ago
nelyahu nelyahu deleted the stage_3_scaled_global_norm_calc branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone