vllm
[EPLB] Reduce EPLB Inference Overhead
#24573
Merged

[EPLB] Reduce EPLB Inference Overhead #24573

tlrmchlsmth merged 6 commits into vllm-project:main from abmfy:eplb-compile
abmfy
abmfy [Feature] Compile expert replica selection
56eafd95
abmfy [Feature] Use module pseudo-random in replica selection
f0cc507d
abmfy Merge branch 'main' into eplb-compile
5baf3392
gemini-code-assist
gemini-code-assist commented on 2025-09-10
abmfy [Style] Add `indices_type` guard
1db8a3cd
robertgshaw2-redhat robertgshaw2-redhat requested a review from ProExpertProg ProExpertProg 162 days ago
mgoin
abmfy [Style] Fix lint errors
db7ffbef
robertgshaw2-redhat robertgshaw2-redhat added eplb
tlrmchlsmth tlrmchlsmth added ready
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-09-22
tlrmchlsmth Merge branch 'main' into eplb-compile
be310d00
tlrmchlsmth tlrmchlsmth requested a review from mgoin mgoin 150 days ago
tlrmchlsmth tlrmchlsmth added this to the v0.10.3 milestone 150 days ago
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 150 days ago
tlrmchlsmth tlrmchlsmth merged 06a41334 into main 150 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone