Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP #31161
fix llama fsdp
a2ba7053
fixup
51881061
Merge remote-tracking branch 'origin/main' into HEAD
51e05137
adding FSDP tests for CPU offloading
91c7b0d3
fixes
d04ae5bb
fix tests
ac7977cd
fix tests
8e65359e
add it for mixtral
705afe43
propagate the changes on other models
3ce51785
Merge branch 'main' into fix-llama-fsdp
bb4c47a4
Update src/transformers/models/phi/modeling_phi.py
bd3fe928
Delete utils/testing_scripts/fsdp_cpu_offloading.py
35981b67
Delete utils/testing_scripts/dummy_fsdp_config.yml
b5e31457
Update + add cache_positions docstring
4c9831ef
amyeroberts
deleted the fix-llama-fsdp branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub