vllm
[DP] Internal Load Balancing Per Node [`one-pod-per-node`]
#21238
Merged

[DP] Internal Load Balancing Per Node [`one-pod-per-node`] #21238

robertgshaw2-redhat
added debug logging
14f13ed6
updated
b90d3316
github-actions
updated
aefeeed6
mergify mergify added frontend
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-07-20
updated
59a95836
updated
48cf09be
updated
2fd05875
updated
14cf3c47
updated
4f5d3eab
updated
14db6606
updated
2aa49757
cleanup
b1425713
updated
e1843b7e
updated
d2d54e9c
fix lb issues
4438796b
updated
2a68433a
updatedd
1ced153e
nits
b9c0f658
nits
dbc51d6e
robertgshaw2-redhat robertgshaw2-redhat requested a review from tlrmchlsmth tlrmchlsmth 163 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from njhill njhill 163 days ago
updated
471fa4ae
njhill
stash
6569facd
stash
1e5303a8
convert to use only one prometheus stat logger per async llm
a69edca3
convert to use only one prometheus stat logger per async llm
de91a3cd
cleanup prometheus logging
e08e1e99
updated
d39cf938
updated
9a2e26d0
tlrmchlsmth
njhill
updated
3956d8cc
updated
cad96705
updated
fd0650f2
updated
896b0a27
robertgshaw2-redhat
updated
54e405bd
updated
02ecfa80
updated
1358836f
updated
4eae5cbe
robertgshaw2-redhat Merge pull request #19 from robertgshaw2-redhat/fix-prometheus-logging
5e6114df
updated
c08fb6d4
cleanup
d9291f99
updated
876c864d
updated
f477b504
updated
5ea4fa20
cleanup
e9e180da
updated
3f4ae353
updated
840d3812
Merge branch 'main' into one-pod-per-node-lb
1b488f8d
revert logger changes
e540aa41
nit comments
72d2c87f
nit comments
6206a06a
refactor ux
c2299047
refactor ux
d9ea345d
updated
d4ab18f1
updated
2cf8ff64
updated
99583c2a
updated
ad34f4a9
updated
fc79d23d
cleanup
6491d592
updated
7127d838
updated
cae7cb02
updated
093b9380
debug
0018dd01
updated
a46bc0a6
seems to be working again, but LB is wrong
85cd2da6
stash
9f7d3217
updated
91608889
stash
be03d841
stash
32a35f5d
updated
fe68027a
stash
2d32c284
cleanup
d327a6be
updated
ec86e797
updated
3c206b19
updated
a5889288
updated
e81c277e
updated
1dcd9006
cleanup
5f0663bc
update ux
6feb4569
update ux
f53166a9
robertgshaw2-redhat robertgshaw2-redhat marked this pull request as ready for review 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from WoosukKwon WoosukKwon 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from ywang96 ywang96 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from comaniac comaniac 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from alexm-redhat alexm-redhat 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from simon-mo simon-mo 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from youkaichao youkaichao 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from mgoin mgoin 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from houseroad houseroad 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from hmellor hmellor 162 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from aarnphm aarnphm 162 days ago
updated
e80c015d
updated
1b481d34
finished validating
40397e37
njhill
njhill commented on 2025-07-21
robertgshaw2-redhat Update vllm/engine/arg_utils.py
58e4227f
robertgshaw2-redhat
njhill
tlrmchlsmth
tlrmchlsmth commented on 2025-07-21
njhill fix data_parallel_hybrid_lb arg default value
7a793ad5
simon-mo simon-mo added this to the v0.10.0 milestone 162 days ago
mergify
mergify mergify added needs-rebase
njhill Merge remote-tracking branch 'origin/main' into one-pod-per-node-lb
60ae2239
mergify mergify removed needs-rebase
njhill fix coordinator for hybrid LB mode
36ed9f34
njhill infer hybrid lb mode on secondary modes
82f9292b
njhill add cross-node dp arg validation
f27a85d4
njhill fix assert
cecf38ae
njhill fix handshake
75bd8ead
njhill njhill added ready
njhill fix cross-node headless arg validation
aca3ce6b
njhill
njhill approved these changes on 2025-07-22
njhill fix handshake mock test
f63cc192
njhill Merge remote-tracking branch 'refs/remotes/origin/main' into one-pod-…
1bd5f2f1
njhill fix bad merge
8601a22d
njhill [Tests] Add tests for headless internal DP LB
d95aedd5
njhill CI tests for hybrid DPLB mode
6328c808
mergify mergify added ci/build
njhill fix internal_dp_lb tests
1c300fcf
njhill rename test
fb0cf7e2
mergify
mergify mergify added needs-rebase
njhill Merge remote-tracking branch 'origin/main' into one-pod-per-node-lb
5fb68091
mergify mergify removed needs-rebase
njhill
tlrmchlsmth
tlrmchlsmth commented on 2025-07-23
tlrmchlsmth relax hybrid dp asserts
35f3782d
simon-mo simon-mo merged d5b981f8 into main 159 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone