transformers
feat(model parallelism): moving the labels to the same device as the logits for gpt2 and bart
#22591
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
Loading