transformers
6200fd7b
- [Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D (#14175)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
[Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D (#14175) * up * up * finish * up * final changes
References
#14175 - [Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D
Author
patrickvonplaten
Parents
e1dc5afd
Loading