vllm
[Core] Support `min_tokens` with speculative decoding
#32642
Merged

[Core] Support `min_tokens` with speculative decoding #32642

qianlihuang
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2026-01-20
qianlihuang qianlihuang changed the title Feat: add sampling (min_tokens,...) support for speculative decoding [Core] Support `min_tokens` with speculative decoding 109 days ago
qianlihuang qianlihuang marked this pull request as ready for review 109 days ago
qianlihuang qianlihuang requested a review from 22quinn 22quinn 109 days ago
qianlihuang qianlihuang requested a review from houseroad houseroad 109 days ago
qianlihuang qianlihuang requested a review from njhill njhill 109 days ago
qianlihuang qianlihuang requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 109 days ago
mergify
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-01-21
mergify
qianlihuang qianlihuang force pushed 109 days ago
qianlihuang qianlihuang force pushed 109 days ago
qianlihuang qianlihuang force pushed 109 days ago
qianlihuang qianlihuang force pushed 107 days ago
qianlihuang qianlihuang force pushed 107 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-01
njhill
qianlihuang qianlihuang marked this pull request as draft 97 days ago
qianlihuang qianlihuang force pushed 96 days ago
qianlihuang qianlihuang force pushed 96 days ago
qianlihuang qianlihuang force pushed 96 days ago
qianlihuang qianlihuang force pushed 96 days ago
mergify
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added deepseek
mergify mergify added frontend
mergify mergify added llama
mergify mergify added multi-modality
mergify mergify added new-model
mergify mergify added performance
mergify mergify added qwen
mergify mergify added gpt-oss
mergify mergify added nvidia
mergify mergify added rocm
mergify mergify added cpu
mergify mergify added structured-output
mergify mergify added speculative-decoding
mergify mergify added tpu
mergify mergify added tool-calling
mergify mergify added kv-connector
mergify
mergify mergify added needs-rebase
qianlihuang qianlihuang closed this 96 days ago
qianlihuang qianlihuang force pushed to 808dd87b 96 days ago
mergify mergify removed tpu
qianlihuang qianlihuang reopened this 96 days ago
mergify mergify removed needs-rebase
qianlihuang qianlihuang force pushed 96 days ago
qianlihuang qianlihuang marked this pull request as ready for review 96 days ago
qianlihuang qianlihuang marked this pull request as draft 96 days ago
qianlihuang qianlihuang marked this pull request as ready for review 96 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-04
qianlihuang qianlihuang marked this pull request as draft 94 days ago
qianlihuang qianlihuang force pushed 94 days ago
qianlihuang qianlihuang marked this pull request as ready for review 94 days ago
qianlihuang
benchislett
benchislett commented on 2026-02-17
benchislett
benchislett commented on 2026-02-17
mergify
mergify mergify added needs-rebase
mergify mergify removed needs-rebase
mergify
qianlihuang qianlihuang requested a review from NickLucche NickLucche 75 days ago
mergify
[Core] Support min_tokens with speculative decoding
f69f5526
qianlihuang qianlihuang force pushed to f69f5526 75 days ago
benchislett
benchislett commented on 2026-02-25
benchislett
benchislett commented on 2026-02-25
add comment
c325db58
benchislett
benchislett approved these changes on 2026-02-25
benchislett benchislett added ready
benchislett benchislett enabled auto-merge (squash) 74 days ago
Update custom logitsproc spec-dec test expectation
1abd6dd8
disabled auto-merge 73 days ago
Head branch was pushed to by a user without write access
qianlihuang Merge branch 'main' into feature/spec-dec-min-tokens
6e4252c4
benchislett benchislett merged d9406076 into main 73 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone