transformers
[Flax] Add remat (gradient checkpointing)
#17843
Merged

[Flax] Add remat (gradient checkpointing) #17843

sanchit-gandhi
sanchit-gandhi sanchit-gandhi added WIP
sanchit-gandhi
sanchit-gandhi commented on 2022-06-23
HuggingFaceDocBuilderDev
sanchit-gandhi sanchit-gandhi requested a review from patil-suraj patil-suraj 3 years ago
sanchit-gandhi sanchit-gandhi requested a review from patrickvonplaten patrickvonplaten 3 years ago
borisdayma
sanchit-gandhi
sanchit-gandhi sanchit-gandhi removed WIP
patrickvonplaten
patrickvonplaten approved these changes on 2022-06-27
borisdayma
patrickvonplaten
[Flax] Add remat (gradient checkpointing)
7dd4c58a
fix variable naming in test
03606c12
flip: checkpoint using a method
9b6e1644
fix naming
395221f8
fix class naming
7b165d53
apply PVP's suggestions from code review
d6040e01
make fix-copies
9972f381
sanchit-gandhi sanchit-gandhi force pushed from c7915740 to 9972f381 3 years ago
fix big-bird, electra, roberta
0a0b6bfa
cookie-cutter
70b7175e
fix flax big-bird
0665a336
move test to common
66c4b14d
sanchit-gandhi sanchit-gandhi merged 485bbe79 into main 3 years ago
sanchit-gandhi sanchit-gandhi deleted the flax-remat branch 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone