transformers
Refactor core_model_loading to support FSDP shard-on-read loading
#44974
Open

Refactor core_model_loading to support FSDP shard-on-read loading #44974

3outeille wants to merge 6 commits into fsdp-vs-ddp from fsdp-core-model-loading
3outeille
3outeille Refactor core_model_loading to support FSDP shard-on-read loading
4b2a9216
3outeille 3outeille force pushed from e5fc7eb3 to 4b2a9216 8 days ago
HuggingFaceDocBuilderDev
3outeille Merge fsdp-vs-ddp into fsdp-core-model-loading
c9f30d9f
3outeille Merge fsdp-vs-ddp (ruff fixes) into fsdp-core-model-loading
b7c3dc5c
3outeille linting
dfdfbdd9
3outeille from_pretrained distributed refactor (FSDP2 + TP) (#44996)
187ee5d4
3outeille DTensor-based TP + FSDP2 shard-on-read composability
607cc114
3outeille 3outeille force pushed from d48fcc7e to 607cc114 6 days ago
github-actions

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone