transformers
15641892 - feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart (#22591)

Commit
3 years ago
feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart (#22591)
Author
Parents
Loading