transformers
fix Dtensor and tensor mismatch for Col/RowRep
#42924
Merged

fix Dtensor and tensor mismatch for Col/RowRep #42924

ArthurZucker merged 25 commits into main from tp_replicate_interface
3outeille
3outeille begin Moe test tensor parallel
40b3e2b6
3outeille create tiny moe model + fix test tensor parallel Moe
05172a98
3outeille create tiny moe model + fix test tensor parallel Moe
d75f4b86
3outeille Merge branch 'main' into v4.57.1-test_tensor_parallel
06635f77
3outeille Merge branch 'main' into v4.57.1-test_tensor_parallel
000c33fd
3outeille fix backward pass test in tensor parallel for Dense model (#42811)
5f548ed9
3outeille Merge branch 'main' into v5-test_tensor_parallel_moe
48c69f7f
3outeille use mixtral instead for testing
87fb140d
3outeille fix dtensor and tensor mismatch
95240730
3outeille linting
ba79de05
3outeille checkout test tensor parallel to be like main
3fed52d7
3outeille Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
ad0f203b
3outeille Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
d6da5af8
3outeille avoid hack and create class instead
12ff9a4b
3outeille fix loading ep
b337af76
3outeille add moe test
7f19dbfb
3outeille now EP inference works again but pass still fails
d677102d
3outeille Add ColwiseParallelReplicate and RowwiseParallelReplicate classes for…
500a5673
3outeille clean
95aba6a1
3outeille Merge branch 'main' into tp_replicate_interface
515698c8
3outeille eaza
1511cfa9
3outeille Merge branch 'tp_replicate_interface' of https://github.com/huggingfa…
0e50e6ff
3outeille aeaeaea
719a0349
3outeille eaeaa
49bed72b
3outeille 3outeille requested a review from ArthurZucker ArthurZucker 4 days ago
3outeille linting
88989a6c
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2025-12-17
ArthurZucker ArthurZucker merged 99be81e7 into main 4 days ago
ArthurZucker ArthurZucker deleted the tp_replicate_interface branch 4 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone