transformers
e708bb75 - Correct TF formatting to exclude LayerNorms from weight decay (#4448)

Commit
5 years ago
Correct TF formatting to exclude LayerNorms from weight decay (#4448) * Exclude LayerNorms from weight decay * Include both formats of layer norm
Author
Parents
Loading