transformers
b1a2fba1 - fix `Dtensor` and `tensor` mismatch (#42906)

Commit
6 days ago
fix `Dtensor` and `tensor` mismatch (#42906) * begin Moe test tensor parallel * create tiny moe model + fix test tensor parallel Moe eaeaae * create tiny moe model + fix test tensor parallel Moe eaeaae fix tensor parallel MoE test fix tensor parallel MoE test * fix backward pass test in tensor parallel for Dense model (#42811) * fix * linting * use mixtral instead for testing * fix dtensor and tensor mismatch * linting * checkout test tensor parallel to be like main * avoid hack and create class instead
Author
Parents
Loading