vllm
[CPU] Enable shared-memory based pipeline parallel for CPU backend
#21289
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
12
Changes
View On
GitHub
[CPU] Enable shared-memory based pipeline parallel for CPU backend
#21289
vllm-bot
merged 12 commits into
vllm-project:main
from
bigPYJ1151:shm_pp
bigPYJ1151
requested a review
from
hmellor
149 days ago
mergify
added
documentation
mergify
added
ci/build
gemini-code-assist
commented on 2025-07-21
shm send/recv
6990811a
enlarge shm buffer
4df43df4
refine kv cache size setting
e3910916
update default batchsize
d04905ab
fix 2.6 compile bug
a7199f74
enable tp/pp e2e test
edcc1748
fix
7c0d4a9e
only run e2e
d5151f8b
fix
016663f4
Revert "only run e2e"
93b2d28a
reduce time
794c2709
bigPYJ1151
force pushed
to
794c2709
149 days ago
fix lint
f2d2fe59
Isotr0py
approved these changes on 2025-07-21
Isotr0py
enabled auto-merge (squash)
148 days ago
github-actions
added
ready
hmellor
commented on 2025-07-21
vllm-bot
merged
a15a50fc
into main
148 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
Isotr0py
hmellor
gemini-code-assist
Assignees
No one assigned
Labels
documentation
ready
ci/build
Milestone
No milestone
Login to write a write a comment.
Login via GitHub