transformers
3ee431dd - [Bart/Memory] Two separate, smaller decoder attention masks (#3371)

Commit
5 years ago
[Bart/Memory] Two separate, smaller decoder attention masks (#3371)
Author
Parents
Loading