transformers
3ceb6833 - Revert "Use torch/tf rsqrt to scale attentions instead of div and math"

Commit
5 years ago
Revert "Use torch/tf rsqrt to scale attentions instead of div and math" This reverts commit 54d46729
Author
Committer
Parents
Loading