vllm
[CPU] Enable shared-memory based pipeline parallel for CPU backend
#21289
Merged

[CPU] Enable shared-memory based pipeline parallel for CPU backend #21289

vllm-bot merged 12 commits into vllm-project:main from bigPYJ1151:shm_pp
bigPYJ1151
bigPYJ1151 bigPYJ1151 requested a review from hmellor hmellor 149 days ago
github-actions
mergify mergify added documentation
mergify mergify added ci/build
gemini-code-assist
gemini-code-assist commented on 2025-07-21
bigPYJ1151 shm send/recv
6990811a
bigPYJ1151 enlarge shm buffer
4df43df4
bigPYJ1151 refine kv cache size setting
e3910916
bigPYJ1151 update default batchsize
d04905ab
bigPYJ1151 fix 2.6 compile bug
a7199f74
bigPYJ1151 enable tp/pp e2e test
edcc1748
bigPYJ1151 fix
7c0d4a9e
bigPYJ1151 only run e2e
d5151f8b
bigPYJ1151 fix
016663f4
bigPYJ1151 Revert "only run e2e"
93b2d28a
bigPYJ1151 reduce time
794c2709
bigPYJ1151 bigPYJ1151 force pushed to 794c2709 149 days ago
bigPYJ1151 fix lint
f2d2fe59
bigPYJ1151
Isotr0py
Isotr0py approved these changes on 2025-07-21
Isotr0py Isotr0py enabled auto-merge (squash) 148 days ago
github-actions github-actions added ready
hmellor
hmellor commented on 2025-07-21
bigPYJ1151
vllm-bot vllm-bot merged a15a50fc into main 148 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone