flax
e6379c1b - option of forcing the input of softmax to be fp32 for better numerical stability in mixed-precision training.

Commit
1 year ago
option of forcing the input of softmax to be fp32 for better numerical stability in mixed-precision training. PiperOrigin-RevId: 627815531
Author
Committer
Parents
Loading