DeepSpeed
8d98e171 - Enable mixtral 8x7b autotp (#5257)

Commit
1 year ago
Enable mixtral 8x7b autotp (#5257) This PR aims to enable mixtral 8x7b (MoE model) autotp. Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
Loading