DeepSpeed
Fix local rank mismatch error when training on nodes with different number of GPUs
#3409
Merged

Fix local rank mismatch error when training on nodes with different number of GPUs #3409

byungsoo-oh
byungsoo-oh Fix local rank mismatch for heterogeneous nodes
a2975c2a
byungsoo-oh byungsoo-oh requested a review from jeffra jeffra 2 years ago
byungsoo-oh byungsoo-oh requested a review from awan-10 awan-10 2 years ago
loadams Merge branch 'master' into fix-local-rank
a25c2d62
loadams Merge branch 'master' into fix-local-rank
3b1b115e
loadams Merge branch 'master' into fix-local-rank
d5c0cc70
loadams Merge branch 'master' into fix-local-rank
77763761
byungsoo-oh Merge branch 'master' into fix-local-rank
7cefc185
byungsoo-oh
loadams Merge branch 'master' into fix-local-rank
dcac9d8c
tjruwase Merge branch 'master' into fix-local-rank
5d8d5d03
loadams Merge branch 'master' into fix-local-rank
8534355b
byungsoo-oh
loadams Merge branch 'master' into fix-local-rank
945e9317
loadams
loadams approved these changes on 2023-06-06
mrwyattii
mrwyattii approved these changes on 2023-06-06
mrwyattii mrwyattii merged b7f463dd into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone