DeepSpeed
Further refactor deepspeed.moe.utils + deepspeed.moe.layer type hints
#5060
Merged

Further refactor deepspeed.moe.utils + deepspeed.moe.layer type hints #5060

ringohoffman
Further refactor split_params_into_different_moe_groups_for_optimizer
d78813de
Improve deepspeed.moe.layer type hints, style
badd7451
ringohoffman ringohoffman requested a review from awan-10 awan-10 1 year ago
Allow max_group_size to be a float so users can pass float("inf")
0b8002e7
mrwyattii
mrwyattii approved these changes on 2024-02-02
Add missed use_rts type hint
ee41f0eb
mrwyattii
Merge branch 'microsoft:master' into further-refactor-split_params_in…
1ba1a4b2
Run formatter
e7f89cb2
mrwyattii Merge branch 'master' into further-refactor-split_params_into_differe…
6b1621ab
mrwyattii mrwyattii merged 9922270f into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone