transformers
[Trainer] use output.loss when using liger-kernel
#42444
Merged

Loading