DeepSpeed ZeRO-3 handling when resizing embedding layers #26259
fix failing deepspeed slow tests
62ab2b87
fixes
07e2d7b7
muellerzr
approved these changes
on 2023-09-19
pacman100
changed the title Smangrul/ds resize emb fix DeepSpeed ZeRO-3 handling when resizing embedding layers 2 years ago
pacman100
merged
ffbf989f
into main 2 years ago
pacman100
deleted the smangrul/ds_resize_emb_fix branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub