transformers
accb7204 - Add Pytorch Tensor Parallel support for Qwen2, Qwen2Moe, Starcoder2 (#35007)

Commit
1 year ago
Add Pytorch Tensor Parallel support for Qwen2, Qwen2Moe, Starcoder2 (#35007) * add base tp plan for qwen2 and qwen2moe * add parallel tp for starcoder2 * fix modular conversion * add infer dim for qkv states * Update src/transformers/models/qwen2_moe/configuration_qwen2_moe.py --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Parents
Loading