flax
MultiHeadAttention only keeps rngs if dropout_rate is positive
#4750
Merged

Commits
  • MultiHeadAttention only keeps rngs if dropout_rate is positive
    a-googler committed 224 days ago
Loading