transformers
DeepSpeed ZeRO-3 handling when resizing embedding layers
#26259
Merged

DeepSpeed ZeRO-3 handling when resizing embedding layers #26259

pacman100 merged 2 commits into main from smangrul/ds_resize_emb_fix
pacman100
pacman100 fix failing deepspeed slow tests
62ab2b87
pacman100 fixes
07e2d7b7
pacman100 pacman100 requested a review from muellerzr muellerzr 2 years ago
pacman100 pacman100 requested a review from ArthurZucker ArthurZucker 2 years ago
HuggingFaceDocBuilderDev
muellerzr
muellerzr approved these changes on 2023-09-19
pacman100 pacman100 changed the title Smangrul/ds resize emb fix DeepSpeed ZeRO-3 handling when resizing embedding layers 2 years ago
ArthurZucker
ArthurZucker approved these changes on 2023-09-19
pacman100 pacman100 merged ffbf989f into main 2 years ago
pacman100 pacman100 deleted the smangrul/ds_resize_emb_fix branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone