feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart #22591
feat(model parallelism): moving the labels to the same device as the …
c0d5df2a
apply make fix-copies
6efbcd5e
sgugger
approved these changes
on 2023-04-05
sgugger
merged
15641892
into main 3 years ago
kausmeows
deleted the kaus branch 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub