transformers
89f0fda5 - Fix the gradient checkpointing bug of the llama model (#22270)

Commit
2 years ago
Fix the gradient checkpointing bug of the llama model (#22270) fix grad ckpt bug of llama
Author
Parents
Loading