feat: Distributed data parallel training support #2464
askorupka
force pushed
from
fb7f9fe8
to
0eabc2ac
1 year ago
askorupka
force pushed
from
e529d6f1
to
71ae53d9
1 year ago
askorupka
force pushed
from
5997a831
to
71ae53d9
1 year ago
first experiment distributed
0393894e
feat: add DistributedUtils (MPI&NCCL working)
76ae0258
feat: add DistributedUtils (MPI&NCCL working)
181cc9c4
fix: no need for amdgpu now
40bf1884
chore: cleanup&propose how to use amdgpu
450f62c2
chore: add preferences for CUDA-awareness
8fbde8d9
feat: fix devices for CUDA-awareness
599f506a
chore: add tests
3382010c
askorupka
force pushed
from
2a100505
to
3382010c
1 year ago
chore: get rid of unnecessary deps
443875e9
askorupka
force pushed
from
a4bad491
to
443875e9
1 year ago
chore: update NEWS.md
330b20bc
CarloLucibello
marked this pull request as ready for review 1 year ago
chore: cleanup env
a255ff90
chore: update docs
3aab47dc
chore: update docs & cleanup
2f54c880
chore: update docs & cleanup
8a984bb9
Update docs/src/guide/gpu.md
c0aefb75
Update docs/src/guide/gpu.md
bd23dd35
Update docs/src/guide/gpu.md
cee91506
Update docs/src/guide/gpu.md
5c85fe8b
Update docs/src/guide/gpu.md
e151eade
Update docs/src/guide/gpu.md
2797924e
Update docs/src/guide/gpu.md
f2cedd52
Update docs/src/guide/gpu.md
a3b62cbb
Update docs/src/guide/gpu.md
a144ccf1
Update docs/src/guide/gpu.md
22b35a01
Update docs/src/guide/gpu.md
7d03ef72
chore: add PR review suggestions
6c11e3c9
Merge branch 'master' into distributed
6a6951a6
chore: fix docs
41acd3fa
fix: add runtests.jl
0e33cfad
chore: small docs update
58ae10f0
chore: remove pkgs from deps
053dcc7d
mcabbott
deleted the distributed branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub