vllm
[Perf] Avoid pageable HtoD transfer in MinTokensLogitsProcessor
#29826
Merged

[Perf] Avoid pageable HtoD transfer in MinTokensLogitsProcessor #29826

jthomson04
jthomson04 jthomson04 requested a review from 22quinn 22quinn 31 days ago
jthomson04 jthomson04 requested a review from houseroad houseroad 31 days ago
jthomson04 jthomson04 requested a review from njhill njhill 31 days ago
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-12-02
chatgpt-codex-connector
chatgpt-codex-connector commented on 2025-12-02
njhill
njhill commented on 2025-12-02
njhill njhill added this to the v0.12.0 milestone 31 days ago
nvpohanh
nvpohanh commented on 2025-12-02
jthomson04 Initial implementation
61ae2add
jthomson04 jthomson04 force pushed from a5349a67 to 61ae2add 31 days ago
jthomson04
jthomson04 jthomson04 requested a review from njhill njhill 31 days ago
jthomson04 jthomson04 requested a review from nvpohanh nvpohanh 31 days ago
nvpohanh
nvpohanh approved these changes on 2025-12-02
njhill
njhill approved these changes on 2025-12-02
njhill njhill added ready
njhill Merge branch 'main' into jthomson04/fix-async-scheduler-with-min-tokens
384a59e2
njhill njhill enabled auto-merge (squash) 30 days ago
khluu Merge branch 'main' into jthomson04/fix-async-scheduler-with-min-tokens
2f631533
DarkLight1337
njhill
jthomson04 Merge branch 'main' into jthomson04/fix-async-scheduler-with-min-tokens
85654423
njhill fix in-place sliced assignment
0b0798bc
njhill
njhill Merge branch 'main' into jthomson04/fix-async-scheduler-with-min-tokens
2ae4af33
njhill njhill merged 1528e079 into main 30 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone