onnxruntime
6ff31e06 - [MoE] Add TP and Mixtral MoE (#19945)

Commit
1 year ago
[MoE] Add TP and Mixtral MoE (#19945) ### Description <!-- Describe your changes. --> 1.Support Tensor Parallelism in ShardedMoE. 2.Make necessary code changes to support Mixtral MoE. 3.Fix a bug related to using IOBinding in test script. 4.Fix the input size limitation ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
Author
Parents
Loading