vllm
[Perf] Optimize `sampled_token_ids` using numpy and remove `tolist`, 0.9% E2E throughput improvement
#35446
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
[Perf] Optimize `sampled_token_ids` using numpy and remove `tolist`, 0.9% E2E throughput improvement
#35446
yewentao256
wants to merge 3 commits into
main
from
wentao-optimize-sampled-token-ids
optimize sampled_token_ids using numpy and remove tolist
c7201f5b
yewentao256
requested a review
from
WoosukKwon
4 days ago
yewentao256
requested a review
from
robertgshaw2-redhat
4 days ago
yewentao256
requested a review
from
njhill
4 days ago
yewentao256
requested a review
from
ywang96
4 days ago
yewentao256
requested a review
from
alexm-redhat
4 days ago
yewentao256
requested a review
from
heheda12345
4 days ago
yewentao256
requested a review
from
ApostaC
4 days ago
yewentao256
requested a review
from
orozery
4 days ago
mergify
added
v1
yewentao256
added
ready
gemini-code-assist
commented on 2026-02-26
fix
7e73d268
fix pre-commit
c1879c1f
Login to write a write a comment.
Login via GitHub
Reviewers
gemini-code-assist
WoosukKwon
robertgshaw2-redhat
njhill
ywang96
alexm-redhat
heheda12345
ApostaC
orozery
Assignees
No one assigned
Labels
ready
v1
Milestone
No milestone
Login to write a write a comment.
Login via GitHub