vllm
[Feature] support sequence parallelism using compilation pass
#16155
Merged

[Feature] support sequence parallelism using compilation pass #16155

vllm-bot merged 24 commits into vllm-project:main from cascade812:sp_pass
cascade812
cascade812 add reduce scatter op and register all gather
9f4dd679
cascade812 replace all reduce with reduce scatter and all gather
09caae61
cascade812 cascade812 requested a review from DarkLight1337 DarkLight1337 1 year ago
cascade812 cascade812 requested a review from ywang96 ywang96 1 year ago
github-actions
cascade812 cascade812 marked this pull request as draft 1 year ago
robertgshaw2-redhat
cascade812 match first embedding
84f4360c
cascade812 update embedding replace pattern
165216d8
cascade812 compile graph only for specific shapes
4318d655
cascade812 cascade812 force pushed to 4318d655 1 year ago
mergify mergify added v1
cascade812 clean code
abd29534
cascade812 cascade812 marked this pull request as ready for review 1 year ago
cascade812 cascade812 requested a review from WoosukKwon WoosukKwon 1 year ago
cascade812 cascade812 requested a review from robertgshaw2-redhat robertgshaw2-redhat 1 year ago
cascade812 cascade812 requested a review from njhill njhill 1 year ago
cascade812 cascade812 requested a review from comaniac comaniac 1 year ago
cascade812 cascade812 requested a review from alexm-redhat alexm-redhat 1 year ago
cascade812 add test and rename
ca7fcb15
mergify mergify added ci/build
yaochengji yaochengji requested a review from tlrmchlsmth tlrmchlsmth 1 year ago
yaochengji
yaochengji commented on 2025-04-11
youkaichao
youkaichao commented on 2025-04-12
youkaichao
youkaichao commented on 2025-04-12
youkaichao
youkaichao commented on 2025-04-12
ProExpertProg
cascade812 address comments
ffb2e24f
ProExpertProg
ProExpertProg commented on 2025-04-13
yaochengji
yaochengji approved these changes on 2025-04-16
tlrmchlsmth
tlrmchlsmth commented on 2025-04-16
tlrmchlsmth
tlrmchlsmth commented on 2025-04-16
tlrmchlsmth
tlrmchlsmth commented on 2025-04-16
tlrmchlsmth
tlrmchlsmth commented on 2025-04-16
mergify
mergify mergify added needs-rebase
tlrmchlsmth
tlrmchlsmth commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
tlrmchlsmth
tlrmchlsmth commented on 2025-04-17
tlrmchlsmth
tlrmchlsmth commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
cascade812 update
46951107
cascade812 cascade812 force pushed to 46951107 1 year ago
cascade812
cascade812 pass in dtype and device
662e6988
tlrmchlsmth
tlrmchlsmth Merge branch 'main' into sp_pass
f60a8712
mergify mergify removed needs-rebase
bnellnm
tlrmchlsmth
cascade812 enable rms_norm automatically if enable_sequence_parallelism=True
9a72e10c
ProExpertProg
cascade812 add test for sq pass
552857c8
yaochengji yaochengji added ready
cascade812 fix failed tests
629e9426
cascade812 fix failed tests
1a608655
cascade812 fix failed tests
0736045f
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-04-21
ProExpertProg
ProExpertProg commented on 2025-04-21
ProExpertProg
ProExpertProg commented on 2025-04-22
cascade812 address comments
534af36d
cascade812 minor fix
c16a1974
zou3519
zou3519 commented on 2025-04-22
cascade812 update test
82527a12
ProExpertProg
ProExpertProg commented on 2025-04-25
cascade812 test FixFunctionalizationPass with SequenceParallelismPass
5b12ce50
cascade812 remove redundant code
230ee3ce
cascade812 Merge remote-tracking branch 'origin' into sp_pass
57d684d8
cascade812 remove the singleton pattern to support two LLM instances.
8dc0422f
cascade812
cascade812 nit
b251ad52
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 1 year ago
vllm-bot vllm-bot merged 690fe019 into main 1 year ago
Juelianqvq
cascade812

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone