transformers
063b4569 - [DeepSpeed] properly handle MoE weight conversion (#43524)

Commit
29 days ago
[DeepSpeed] properly handle MoE weight conversion (#43524) * properly handle MoE weight conversion * fix style * Non-expert keys bug fix * remove dead code
Author
Parents
Loading