Backport ZeRO1: Add bucketting logic to control the size of tensors for all-gather/reduce-scatter (#6025) #6806
ZeRO1: Add bucketting logic to control the size of tensors for all-ga…
f28946f9
lsy323
approved these changes
on 2024-03-25
lsy323
merged
a805505d
into r2.3 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub