vllm
[Core] Support `min_tokens` with speculative decoding
#32642
Merged

[Core] Support `min_tokens` with speculative decoding #32642

qianlihuang
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2026-01-20
qianlihuang qianlihuang changed the title Feat: add sampling (min_tokens,...) support for speculative decoding [Core] Support `min_tokens` with speculative decoding 38 days ago
qianlihuang qianlihuang marked this pull request as ready for review 38 days ago
qianlihuang qianlihuang requested a review from 22quinn 22quinn 38 days ago
qianlihuang qianlihuang requested a review from houseroad houseroad 38 days ago
qianlihuang qianlihuang requested a review from njhill njhill 38 days ago
qianlihuang qianlihuang requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 38 days ago
mergify
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-01-21
mergify
qianlihuang qianlihuang force pushed 38 days ago
qianlihuang qianlihuang force pushed 38 days ago
qianlihuang qianlihuang force pushed 38 days ago
qianlihuang qianlihuang force pushed 36 days ago
qianlihuang qianlihuang force pushed 36 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-01
njhill
qianlihuang qianlihuang marked this pull request as draft 26 days ago
qianlihuang qianlihuang force pushed 25 days ago
qianlihuang qianlihuang force pushed 25 days ago
qianlihuang qianlihuang force pushed 25 days ago
qianlihuang qianlihuang force pushed 25 days ago
mergify
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added deepseek
mergify mergify added frontend
mergify mergify added llama
mergify mergify added multi-modality
mergify mergify added new-model
mergify mergify added performance
mergify mergify added qwen
mergify mergify added gpt-oss
mergify mergify added nvidia
mergify mergify added rocm
mergify mergify added cpu
mergify mergify added structured-output
mergify mergify added speculative-decoding
mergify mergify added tpu
mergify mergify added tool-calling
mergify mergify added kv-connector
mergify
mergify mergify added needs-rebase
qianlihuang qianlihuang closed this 25 days ago
qianlihuang qianlihuang force pushed to 808dd87b 25 days ago
mergify mergify removed tpu
qianlihuang qianlihuang reopened this 25 days ago
mergify mergify removed needs-rebase
qianlihuang qianlihuang force pushed to ac443b33 25 days ago
qianlihuang qianlihuang marked this pull request as ready for review 25 days ago
qianlihuang qianlihuang marked this pull request as draft 25 days ago
qianlihuang qianlihuang marked this pull request as ready for review 25 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-04
qianlihuang qianlihuang marked this pull request as draft 23 days ago
qianlihuang qianlihuang force pushed to 0cee7fa8 23 days ago
qianlihuang qianlihuang marked this pull request as ready for review 23 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-17
benchislett
benchislett commented on 2026-02-17
mergify
mergify mergify added needs-rebase
mergify mergify removed needs-rebase
mergify
qianlihuang qianlihuang requested a review from NickLucche NickLucche 4 days ago
mergify
[Core] Support min_tokens with speculative decoding
f69f5526
qianlihuang qianlihuang force pushed from 468b2fac to f69f5526 4 days ago
benchislett
benchislett commented on 2026-02-25
benchislett
benchislett commented on 2026-02-25
add comment
c325db58
benchislett
benchislett approved these changes on 2026-02-25
benchislett benchislett added ready
benchislett benchislett enabled auto-merge (squash) 3 days ago
Update custom logitsproc spec-dec test expectation
1abd6dd8
disabled auto-merge 2 days ago
Head branch was pushed to by a user without write access
qianlihuang Merge branch 'main' into feature/spec-dec-min-tokens
6e4252c4
benchislett benchislett merged d9406076 into main 1 day ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone