transformers
41d47db9 - [Bugfix] `OPTDecoderLayer` does not return attentions when `gradient_checkpointing` and `training` is enabled. (#23367)

Commit
2 years ago
[Bugfix] `OPTDecoderLayer` does not return attentions when `gradient_checkpointing` and `training` is enabled. (#23367) Update modeling_opt.py
Author
Parents
Loading