DeepSpeed
3dd7ccff - enable phi3_mini autotp (#5501)

Commit
1 year ago
enable phi3_mini autotp (#5501) This PR aims to enable phi3 mini autotp. Phi3 mini uses chunk MLP. We adjust this linear layer weight order to support this model. Please kindly review~ Thanks! --------- Co-authored-by: Lev Kurilenko <113481193+lekurile@users.noreply.github.com>
Author
Parents
Loading