DeepSpeed
970015bb - efficient communication layout for training MoE architecture at large scale

Commit
2 years ago
efficient communication layout for training MoE architecture at large scale
Author
Reza Yazdani
Parents
Loading