Refactor core_model_loading to support FSDP shard-on-read loading #44974
3outeille
force pushed
from
e5fc7eb3
to
4b2a9216
93 days ago
3outeille
force pushed
from
d48fcc7e
to
607cc114
91 days ago
3outeille
force-pushed the
fsdp-vs-ddp
branch
from
978ac872
to
a5c25548
73 days ago
DistributedConfig + shard-on-read loading
739332cd
3outeille
force pushed
from
607cc114
to
739332cd
73 days ago
3outeille
force-pushed the
fsdp-vs-ddp
branch
from
864e9fa4
to
7f6cd3d8
72 days ago
3outeille
force pushed
from
dbc96197
to
c5672400
72 days ago
3outeille
force-pushed the
fsdp-vs-ddp
branch
from
7f6cd3d8
to
37dcc14d
72 days ago
Merge branch 'fsdp-vs-ddp' into fsdp-core-model-loading
c1dab9eb
3outeille
force pushed
from
c5672400
to
c1dab9eb
72 days ago
Fix ruff formatting in core_model_loading.py
21f05610
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub