transformers
8c5e29ba - Avoid unnecessary device operations in loss computing (#36950)

Commit
263 days ago
Avoid unnecessary device operations in loss computing (#36950) * Avoid unnecessary tensor copy in loss computing * Add type
Author
Parents
Loading