transformers
cd73b9a8 - Update: ignore padding support for TransfoXL training when n_clusters==0 (#22457)

Commit
3 years ago
Update: ignore padding support for TransfoXL training when n_clusters==0 (#22457) * Update: ignore padding support for TransfoXL training when n_clusters==0 * Update: transformer XL always pad * Update: drop doc
Author
Parents
Loading