transformers
Always initialize tied output_embeddings if it has a bias term
#28947
Merged

Loading