Fix bf16 gradient norm divergence with ZeRO stage 0 #7839
Fix ZeRO-0 + bf16 broken training: disable loss scaling and fix zero_…
773c607b
add test
c5f60af0
Address PR feedback for issue #7837 loss-scale refactor
459860ce
tohtana
merged
1752c2ab
into master 55 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub