transformers
Fix the gradient checkpointing bug of the llama model
#22270
Merged

Fix the gradient checkpointing bug of the llama model #22270

sgugger merged 1 commit into huggingface:main from main
yqy2001
yqy2001 fix grad ckpt bug of llama
fe32d793
sgugger
sgugger approved these changes on 2023-03-20
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2023-03-20
sgugger sgugger merged 89f0fda5 into main 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone