transformers
170fcaa6 - Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273)

Commit
3 years ago
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) * generalize decay_mask_fn to find all layernorm params * fixup * generalising decay_mask_fn
Author
Parents
Loading