flax
65dfd465 - MultiHeadAttention only keeps rngs if dropout_rate is positive

Commit
223 days ago
MultiHeadAttention only keeps rngs if dropout_rate is positive PiperOrigin-RevId: 759468680
Author
Cristian Garcia
Committer
Parents
Loading