xla
Enable bucketized all-reduce for gradients
#7216
Merged

Enable bucketized all-reduce for gradients #7216

JackCaoG merged 11 commits into master from bucketing_gradients
amithrm
JackCaoG JackCaoG requested a review from jeffhataws jeffhataws 1 year ago
JackCaoG JackCaoG requested a review from alanwaketan alanwaketan 1 year ago
jeffhataws
jeffhataws
jeffhataws requested changes on 2024-06-07
JackCaoG
JackCaoG
JackCaoG
jeffhataws
JackCaoG
JackCaoG
JackCaoG approved these changes on 2024-06-11
jeffhataws
jeffhataws
jeffhataws approved these changes on 2024-06-11
JackCaoG
amithrm Gradient bucketing using a pre-defined bucket size cap
c491f05c
amithrm Fix linter issues
3f7cc775
amithrm Added ALLREDUCE_BUCKET_SIZE_MB to turn on bucketing for allreduce
4c1233b0
amithrm Fix import
45ba8857
amithrm Fixing API for allreduce bucketized gradients
d7ecbbaa
amithrm fix linter checks
35991a69
amithrm Fixing test case
7fba20aa
jeffhataws Add bucketized all-reduce test to run_tests.sh; move test into torch_…
a7193b4a
jeffhataws Add init_method='xla://' to bucketized allreduce test
c63f00cc
jeffhataws Lint fix
f332a8fe
jeffhataws jeffhataws force pushed from b1bf93c7 to f332a8fe 1 year ago
jeffhataws Add bucket_cap_mb arguments; fix bucketized allreduce test
85c6ec17
jeffhataws jeffhataws changed the title Bucketing gradients Enable bucketized all-reduce for gradients 1 year ago
JackCaoG JackCaoG merged 28f9887b into master 1 year ago
ManfeiBai
jeffhataws
jeffhataws jeffhataws deleted the bucketing_gradients branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone