transformers
FSDP orchestration: apply + loading/saving
#46990
Open

FSDP orchestration: apply + loading/saving #46990

3outeille
3outeille Add FSDP orchestration: mesh init, distribute-before-load, and DCP save.
7c113940
3outeille Merge branch 'split/a-pr-3-dual-path-loading' into split/a-pr-4-fsdp-…
17c6d402
3outeille add fsdp plan to 2 models for now
00eb1166
3outeille add tests fsdp mixin
be296ddf
3outeille linting
05900d60
HuggingFaceDocBuilderDev
3outeille refactor test fsdp mixin
fc2423bd
3outeille test fsdp mixin cleaning
5bbd8201
3outeille remove fsdp policy in tests + trim down further
b6d0b67a
3outeille test fsdp clean
ea361231
3outeille restore test_modeling_utils
bec4d23a
3outeille linting
8d3d3298
3outeille start trim down stuff
6316ee1e
3outeille fix
6e9004ec
3outeille 3outeille marked this pull request as draft 1 day ago
3outeille breaking: cleaning modeling_utils.py
68df491a
3outeille load path with fsdp (dtensor) and tp (old tp) is linked
16b0b291
3outeille linting
e976a44c
3outeille add saving
a2fb155f
3outeille styling
5f52f196
3outeille fix tp ci
7f543017
3outeille add fsdp to ci
99f79acc
3outeille linting
b4514906
3outeille pick one model only for this PR
54ff4d1d
3outeille restore
33995399
3outeille trigger fsdp ci
11cf79c2
3outeille doc cleaning + tp_size remove
5b7ac3e5
3outeille fix tp ci for ep
06b0c394
3outeille edit doc
7a94d77a
3outeille 3outeille changed the title FSDP orchestration: mesh init, distribute-before-load, DCP save FSDP orchestration: apply + loading/saving 2 hours ago
3outeille move distributed function to utils + guarding
f3c742b7
3outeille linting
37df13ee
3outeille 3outeille marked this pull request as ready for review 1 hour ago
3outeille
github-actions
3outeille Merge branch 'split/a-pr-3-dual-path-loading' into split/a-pr-4-fsdp-…
86875d22
github-actions
github-actions
3outeille Merge branch 'split/a-pr-3-dual-path-loading' into split/a-pr-4-fsdp-…
450579ba
github-actions
3outeille 3outeille requested a review from ArthurZucker ArthurZucker 27 minutes ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone