DeepSpeed
support bf16_optimizer moe expert parallel training and moe EP grad_scale/grad_norm fix
#5259
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
20
Changes
View On
GitHub
Loading