transformers
Improve TF weight loading, especially PT crossloading
#21792
Merged

Improve TF weight loading, especially PT crossloading #21792

Rocketknight1 merged 24 commits into main from tf_weight_loading_fixes
Rocketknight1
HuggingFaceDocBuilderDev
Rocketknight1 Rocketknight1 force pushed to db791924 3 years ago
Rocketknight1 Rocketknight1 marked this pull request as ready for review 3 years ago
Rocketknight1 Rocketknight1 requested a review from ydshieh ydshieh 3 years ago
Rocketknight1 Rocketknight1 requested a review from sgugger sgugger 3 years ago
Rocketknight1 Rocketknight1 requested a review from gante gante 3 years ago
Rocketknight1 First commit for the improved PT-TF weight loading
f9cee34f
Rocketknight1 Remove workarounds from TFEncoderDecoder tests
9d18cea5
Rocketknight1 Allow a custom weight renaming function in from_pretrained and use th…
cfde5786
Rocketknight1 make fixup
cef740d5
Rocketknight1 First attempt at visionencoderdecoder
84bbacb6
Rocketknight1 Disable tensorfloat32 in tests to get consistent outputs
bc020f8d
Rocketknight1 Quick fix to tf_vision_encoder_decoder tests
8bdcd36d
Rocketknight1 make fixup
1c5d7cf9
Rocketknight1 Update Blenderbot tests
44db428c
Rocketknight1 Remove unused arg in modeling_tf_opt
edea418e
Rocketknight1 load_tf_sharded_weights had strict=True! This meant transfer learning…
c7e3d797
Rocketknight1 Support prefixes when loading sharded TF checkpoints
6b352e1e
Rocketknight1 make fixup
f66de506
Rocketknight1 Rocketknight1 force pushed to 6b352e1e 3 years ago
Rocketknight1 Rocketknight1 requested a review from LysandreJik LysandreJik 3 years ago
Rocketknight1
Rocketknight1
Rocketknight1 commented on 2023-02-28
Rocketknight1 Add test to load sharded models with a weight prefix
d7c17700
Rocketknight1 Fix sharded weight loading test
309958c8
Rocketknight1 Add a test for transfer from a sharded checkpoint
0f699914
Rocketknight1 make fixup
2fd46b41
Rocketknight1 Add test to check that crossloading from PT with a prefix works
00da6cea
Rocketknight1 Rocketknight1 removed review request from LysandreJik LysandreJik 3 years ago
Rocketknight1 Refactor from_pretrained in the encoderdecoder classes
27631203
Rocketknight1 Refactor from_pretrained in the encoderdecoder classes
9c26cfdf
gante
gante approved these changes on 2023-02-28
Rocketknight1 missmatched -> mismatched
faf91fc4
sgugger
sgugger approved these changes on 2023-02-28
Rocketknight1 Explicitly check for None
58cab95e
Rocketknight1 No comments showing my very impressive and attractive knowledge of Py…
dda10e47
Rocketknight1 Disable TF32 across all TF tests
893d669e
Rocketknight1 Rocketknight1 merged acfb714b into main 3 years ago
Rocketknight1 Rocketknight1 deleted the tf_weight_loading_fixes branch 3 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone