Multiply lr scheduler steps by `num_processes`. #3983
Multiply lr scheduler steps by `num_processes`.
b32bcb8e
sayakpaul
approved these changes
on 2023-07-07
Stop multiplying steps by gradient accumulation.
660c5519
muellerzr
approved these changes
on 2023-07-13
sayakpaul
merged
ece55227
into main 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub