transformers
66ecc2b6 - disable use_cache if using gradient checkpointing (#30320)

Commit
1 year ago
disable use_cache if using gradient checkpointing (#30320)
Author
Committer
Parents
Loading