transformers
ffbf989f
- DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259) * fix failing deepspeed slow tests * fixes
References
#26259 - DeepSpeed ZeRO-3 handling when resizing embedding layers
Author
pacman100
Parents
39df4eca
Loading