transformers
[DeepSpeed] properly handle MoE weight conversion
#43524
Merged

[DeepSpeed] properly handle MoE weight conversion #43524

kashif
kashif properly handle MoE weight conversion
d31144c0
kashif fix style
0b414b5c
HuggingFaceDocBuilderDev
kashif Non-expert keys bug fix
dc86097e
kashif Merge branch 'main' into moe_weight_load_Fix
25064c61
kashif Merge branch 'main' into moe_weight_load_Fix
9b924f3f
ArthurZucker
ArthurZucker commented on 2026-01-28
kashif remove dead code
5ea24d7b
kashif Merge branch 'main' into moe_weight_load_Fix
efc0ec11
ArthurZucker
ArthurZucker approved these changes on 2026-02-02
ArthurZucker ArthurZucker merged 063b4569 into main 40 days ago
kashif kashif deleted the moe_weight_load_Fix branch 40 days ago
jiosephlee
kashif
jiosephlee

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone