vllm
[Optimization] Cache sampled token ids in model runner
#20291
Merged

[Optimization] Cache sampled token ids in model runner #20291

WoosukKwon merged 25 commits into main from woosuk/token-id
WoosukKwon
WoosukKwon Implement Async Scheduler
487efdc3
WoosukKwon Merge branch 'main' into woosuk/async-sched
a3c320d7
WoosukKwon optimization
bde64a48
WoosukKwon Flatten cached_reqs_data
e4f91493
WoosukKwon fix nccl connector
5804545c
WoosukKwon shared storage connector
5fcb42d5
WoosukKwon fix test
f06cb35a
WoosukKwon fix more tests
c849f586
WoosukKwon Merge branch 'main' into woosuk/async-sched
662a60db
WoosukKwon Merge branch 'woosuk/serial' into woosuk/async-sched
85256779
WoosukKwon Merge branch 'woosuk/async-sched' of https://github.com/vllm-project/…
388774b7
WoosukKwon fix
85384796
WoosukKwon Merge branch 'main' into woosuk/async-sched
3c783e66
Merge branch 'main' into woosuk/token-id
7ade9aac
WoosukKwon [Optimization] Cache sampled token ids in model runner
a3b9964f
WoosukKwon minor
278c9779
WoosukKwon minor
26728f53
WoosukKwon WoosukKwon requested a review from robertgshaw2-redhat robertgshaw2-redhat 316 days ago
WoosukKwon WoosukKwon requested a review from njhill njhill 316 days ago
WoosukKwon WoosukKwon requested a review from ywang96 ywang96 316 days ago
WoosukKwon WoosukKwon requested a review from comaniac comaniac 316 days ago
WoosukKwon WoosukKwon requested a review from alexm-redhat alexm-redhat 316 days ago
github-actions
WoosukKwon WoosukKwon requested a review from LucasWilkinson LucasWilkinson 316 days ago
gemini-code-assist
gemini-code-assist commented on 2025-07-01
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-07-01
WoosukKwon fix shared storage connector
20642834
WoosukKwon Merge branch 'main' into woosuk/token-id
0a0e177c
WoosukKwon yapf
ea07d183
WoosukKwon minor
e0f8c514
WoosukKwon WoosukKwon added ready
WoosukKwon fix
d577f7b0
LucasWilkinson
LucasWilkinson commented on 2025-07-01
LucasWilkinson
LucasWilkinson commented on 2025-07-01
WoosukKwon address review
c51372e1
LucasWilkinson
LucasWilkinson approved these changes on 2025-07-01
WoosukKwon Merge branch 'main' into woosuk/token-id
3441d025
WoosukKwon Fix test_gpu_model_runner
23172945
WoosukKwon WoosukKwon merged 7f280d69 into main 316 days ago
WoosukKwon WoosukKwon deleted the woosuk/token-id branch 316 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone