fix Dtensor and tensor mismatch for Col/RowRep #42924
begin Moe test tensor parallel
40b3e2b6
create tiny moe model + fix test tensor parallel Moe
05172a98
create tiny moe model + fix test tensor parallel Moe
d75f4b86
Merge branch 'main' into v4.57.1-test_tensor_parallel
06635f77
Merge branch 'main' into v4.57.1-test_tensor_parallel
000c33fd
fix backward pass test in tensor parallel for Dense model (#42811)
5f548ed9
Merge branch 'main' into v5-test_tensor_parallel_moe
48c69f7f
use mixtral instead for testing
87fb140d
fix dtensor and tensor mismatch
95240730
linting
ba79de05
checkout test tensor parallel to be like main
3fed52d7
Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
ad0f203b
Merge branch 'main' into fix_dtensor_tensor_moe_mismatch
d6da5af8
avoid hack and create class instead
12ff9a4b
fix loading ep
b337af76
add moe test
7f19dbfb
now EP inference works again but pass still fails
d677102d
Add ColwiseParallelReplicate and RowwiseParallelReplicate classes for…
500a5673
clean
95aba6a1
Merge branch 'main' into tp_replicate_interface
515698c8
eaza
1511cfa9
Merge branch 'tp_replicate_interface' of https://github.com/huggingfa…
0e50e6ff
aeaeaea
719a0349
eaeaa
49bed72b
linting
88989a6c
ArthurZucker
deleted the tp_replicate_interface branch 4 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub