transformers
exclude fsdp from delay_optimizer_creation
#34140
Merged

exclude fsdp from delay_optimizer_creation #34140

eljandoubi
exclude fsdp from delay_optimizer_creation
cd0e8bb8
muellerzr
muellerzr commented on 2024-10-14
HuggingFaceDocBuilderDev
eljandoubi
add test case for trainer: FSDP mode and fp8 as mixed precision
d18e6424
rearrange imports
3344110c
ruff formatted
5055e2a2
eljandoubi
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
656d7cc5
muellerzr
eljandoubi
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
22cc58dd
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
4827a392
adapt _init_fsdp to fp8
4a84f0f0
use _init_fsdp only when resume_from_checkpoint
2e91c5f4
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
f5a3796a
In case of FDP, self.layer will be CheckpointWrapper which has no len…
af73835e
delete _init_fsdp
a2f30b0c
solve conflict
a838ba55
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
cc5b4c3f
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
d84336fc
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
acffb63c
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
78eed705
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
49882f88
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
9ac46640
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
5acf8e05
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
d7a01949
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
58d18f67
fix conflict
b94376d4
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
b9b9eb4f
eljandoubi
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
f4665139
muellerzr
muellerzr approved these changes on 2024-10-23
muellerzr muellerzr requested a review from ArthurZucker ArthurZucker 1 year ago
muellerzr muellerzr requested a review from SunMarc SunMarc 1 year ago
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
2948b297
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
748270da
make fixup
0ec8e587
eljandoubi Merge branch 'huggingface:main' into fix_fsdp_with_fp8_in_trainer
09df2edc
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
a3265d9c
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
33902fdf
eljandoubi
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
cfd81524
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
571e58fd
eljandoubi Merge branch 'main' into fix_fsdp_with_fp8_in_trainer
02a63c7f
ArthurZucker
ArthurZucker approved these changes on 2024-10-28
ArthurZucker ArthurZucker merged 8b3b9b48 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone