Handle MoE models with DeepSpeed #2662
Handle MoE models with DeepSpeed
cb81cdf8
Update launch.py
ed585c0e
Update test_deepspeed.py
9ddfd2b0
Update test_deepspeed.py
55795e78
pacman100
marked this pull request as ready for review 1 year ago
muellerzr
approved these changes
on 2024-04-12
Update src/accelerate/utils/dataclasses.py
826718cc
address comments
6dd56e4c
Update deepspeed.md
be507b3d
pacman100
merged
701e24c5
into main 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub