transformers
XLA train step fixes
#17973
Merged

XLA train step fixes #17973

Rocketknight1 merged 21 commits into main from xla_train_step_fixes
Rocketknight1
Rocketknight1 Copy inputs to train and test step before modifying them, as this bre…
b924b407
Rocketknight1 Add XLA tests, fix our loss functions to be XLA-compatible
0414cedc
Rocketknight1 Rocketknight1 requested a review from gante gante 3 years ago
Rocketknight1 Rocketknight1 requested a review from LysandreJik LysandreJik 3 years ago
Rocketknight1 Rocketknight1 requested a review from sgugger sgugger 3 years ago
Rocketknight1 make fixup
167fd324
HuggingFaceDocBuilderDev
LysandreJik
LysandreJik LysandreJik requested a review from ydshieh ydshieh 3 years ago
ydshieh
ydshieh commented on 2022-07-01
gante
gante commented on 2022-07-01
Rocketknight1 Update loss computation test to expect vector of per-sample losses
e01286d3
Rocketknight1 Patch loss for TFLED
3e537933
Rocketknight1 Patch loss for TFAlbert
43ce3f58
sgugger
sgugger commented on 2022-07-01
Rocketknight1 Add a tf_legacy_loss config flag that enables old loss functions
4060777a
ydshieh
sgugger
sgugger commented on 2022-07-01
Rocketknight1 Stop using config.get() because it's not a dict
391b050d
Rocketknight1 Skip loss computation test for RAG because its loss is very strange a…
8035a272
Rocketknight1 make fixup
58e3db87
sgugger
sgugger approved these changes on 2022-07-01
Rocketknight1 Add XLA-compatible RAG loss
3b9fe743
Rocketknight1 Fix dtype of loss mask for TFAlbert
db79798f
Rocketknight1 Fix test for XLNet too because it overrides the default one
9a6b7b58
Rocketknight1 make fixup
92a4e798
Rocketknight1 Fix config test
6021439b
Rocketknight1
Rocketknight1 No more depending on GPU NaN behaviour
a46da255
Rocketknight1 Add test, avoid potential zero division
64c0e77e
Rocketknight1 Fix test item assignment
d34a3b2f
Rocketknight1 Fix loss computation masking test
32078b24
Rocketknight1 make fixup
a19ee4fd
Rocketknight1 Fix dtype bugs
f17136c8
Rocketknight1 Rocketknight1 merged d6cec458 into main 3 years ago
Rocketknight1 Rocketknight1 deleted the xla_train_step_fixes branch 3 years ago
ydshieh
Rocketknight1
patrickvonplaten
patrickvonplaten commented on 2022-07-04
Rocketknight1
patrickvonplaten
patrickvonplaten commented on 2022-07-04

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone