DeepSpeed
MoE inference + PR-MoE model support
#1705
Merged

Loading