Tp param level #46290

3outeille merged 34 commits into distributed from tp_param_level
3outeille
HuggingFaceDocBuilderDev
3outeille [distributed] Add param-level MoE TP/EP styles and ep_router
f20c14c1
3outeille [distributed] Add param-level apply pass to apply_tensor_parallel
bcecab89
3outeille [distributed] Add MoE TP/EP plan tests and a two-sided sharding asser…
963777a0
3outeille [distributed] Migrate MoE configs to decomposed TP/SP expert plans
6a6dda75
3outeille [distributed] Document decomposed MoE TP/EP plans
6d73f841
3outeille [distributed] Rename MoE intra-expert TP styles to moe_tp_*
ec2212bd
3outeille 3outeille force pushed from 23264318 to ec2212bd 27 days ago
3outeille handle sparse and dense sp plan for qwen3_moe
654fa22a
3outeille better tests coverage for sp & ep
57b5a6e3
3outeille linting
e72ee624
3outeille uniformize TP Api to avoid confusion with torch native ops
2ff38219
3outeille inline tp
f5493685
3outeille rename
40897616
3outeille cleaning
18bccfaf
3outeille inline
5da75494
3outeille cleaning
a08c6acf
3outeille cleaning
b78f2347
3outeille linting
45925992
3outeille fix ci ep_backward
229235d8
3outeille linting
8cdcd889
3outeille remove flag expert parallel
6d0e153b
3outeille fix
df59a892
3outeille 3outeille marked this pull request as ready for review 24 days ago
3outeille 3outeille requested a review from ArthurZucker ArthurZucker 24 days ago
3outeille add tp plan + ep_plan
9add3298
3outeille revert doc
4bc3980b
3outeille fix install_forward
8393d5a0
3outeille linting
36a3c379
3outeille add moe identity back
24d042e0
3outeille no need aymore
69e118d8
3outeille update tp_plan for ernie4_5_vl_moe
b5d0bbab
3outeille sp + ep training / tp + ep inference (#46292)
d736cb69
3outeille Merge branch 'distributed' into tp_param_level
45935922
3outeille Merge branch 'distributed' into tp_param_level
d47f10a1
3outeille fix merge conflicts
79f20e90
3outeille linting
82d06ba4
3outeille Merge branch 'distributed' into tp_param_level
f005633b
github-actions
github-actions
3outeille 3outeille merged 30972993 into distributed 9 days ago
3outeille 3outeille deleted the tp_param_level branch 9 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone