transformers
91df4551 - [Trainer] Make sure shown loss in distributed training is correctly averaged over all workers (#13681)

Loading