transformers
3ee431dd
- [Bart/Memory] Two separate, smaller decoder attention masks (#3371)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
6 years ago
[Bart/Memory] Two separate, smaller decoder attention masks (#3371)
Author
sshleifer
Parents
53fe7338
Loading