transformers
[LED] fix global_attention_mask not being passed for generation and docs clarification about grad checkpointing
#17112
Merged

[LED] fix global_attention_mask not being passed for generation and docs clarification about grad checkpointing #17112

caesar-one
caesar-one [LED] fixed global_attention_mask not passed for generation + docs cl…
411d3f48
caesar-one Merge branch 'huggingface:main' into led-generation-fix-and-documenta…
b0733457
HuggingFaceDocBuilderDev
JohnGiorgi
patrickvonplaten
patrickvonplaten approved these changes on 2022-05-09
ydshieh ydshieh requested a review from ydshieh ydshieh 4 years ago
ydshieh
ydshieh approved these changes on 2022-05-09
ydshieh ydshieh requested a review from ydshieh ydshieh 4 years ago
caesar-one
patrickvonplaten
patrickvonplaten commented on 2022-05-10
patrickvonplaten
patrickvonplaten commented on 2022-05-10
caesar-one LED docs clarification
ae2584f7
caesar-one [LED] gradient_checkpointing=True should be passed to TrainingArguments
8c8a4971
patrickvonplaten
patrickvonplaten commented on 2022-05-17
patrickvonplaten
patrickvonplaten commented on 2022-05-17
caesar-one [LED] docs: remove wrong word
49dfaa2a
caesar-one [LED] docs fix typo
fa198da3
patrickvonplaten patrickvonplaten merged d9050dc7 into main 4 years ago
patrickvonplaten

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone