transformers
5a70a77b
- Add Support to Gradient Checkpointing for LongT5 (#18977)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Add Support to Gradient Checkpointing for LongT5 (#18977) FlaxLongT5PreTrainedModel is missing "enable_gradient_checkpointing" function. This gives an error if someone tries to enable gradient checkpointing for longt5. This pull request fixes it.
References
#18977 - Add Support to Gradient Checkpointing for LongT5
Author
agemagician
Parents
4157e3cd
Loading