transformers
fix deepspeed load best model at end when the model gets sharded
#25057
Merged

fix deepspeed load best model at end when the model gets sharded #25057

pacman100
pacman100 fix deepspeed load best model at end when the model gets sharded
aa6ea05e
HuggingFaceDocBuilderDev
pacman100 pacman100 marked this pull request as ready for review 2 years ago
pacman100 pacman100 requested a review from sgugger sgugger 2 years ago
sgugger
sgugger approved these changes on 2023-07-27
pacman100 pacman100 merged a0042379 into main 2 years ago
pacman100 pacman100 deleted the smangrul/deepspeed-load-best-model branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone