transformers
0e2cf025
- DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259) * fix failing deepspeed slow tests * fixes
Author
pacman100
Committer
LysandreJik
Parents
c9b9d87e
Loading