transformers
4eb17b26 - Drop inplace operation for loss computation with gradient accumulation (#35416)

Commit
1 year ago
Drop inplace operation for loss computation with gradient accumulation (#35416) Fix inplace loss computation
Author
Parents
Loading