vllm
[Core] Support `min_tokens` with speculative decoding
#32642
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
[Core] Support `min_tokens` with speculative decoding
#32642
benchislett
merged 4 commits into
vllm-project:main
from
qianlihuang:feature/spec-dec-min-tokens
mergify
added
v1
gemini-code-assist
commented on 2026-01-20
qianlihuang
changed the title
Feat: add sampling (min_tokens,...) support for speculative decoding
[Core] Support `min_tokens` with speculative decoding
38 days ago
qianlihuang
marked this pull request as ready for review
38 days ago
qianlihuang
requested a review
from
22quinn
38 days ago
qianlihuang
requested a review
from
houseroad
38 days ago
qianlihuang
requested a review
from
njhill
38 days ago
qianlihuang
requested a review
from
copilot-pull-request-reviewer
38 days ago
copilot-pull-request-reviewer
commented on 2026-01-21
qianlihuang
force pushed
38 days ago
qianlihuang
force pushed
38 days ago
qianlihuang
force pushed
38 days ago
qianlihuang
force pushed
36 days ago
qianlihuang
force pushed
36 days ago
benchislett
commented on 2026-02-01
qianlihuang
marked this pull request as draft
26 days ago
qianlihuang
force pushed
25 days ago
qianlihuang
force pushed
25 days ago
qianlihuang
force pushed
25 days ago
qianlihuang
force pushed
25 days ago
mergify
added
documentation
mergify
added
ci/build
mergify
added
deepseek
mergify
added
frontend
mergify
added
llama
mergify
added
multi-modality
mergify
added
new-model
mergify
added
performance
mergify
added
qwen
mergify
added
gpt-oss
mergify
added
nvidia
mergify
added
rocm
mergify
added
cpu
mergify
added
structured-output
mergify
added
speculative-decoding
mergify
added
tpu
mergify
added
tool-calling
mergify
added
kv-connector
mergify
added
needs-rebase
qianlihuang
closed this
25 days ago
qianlihuang
force pushed
to
808dd87b
25 days ago
mergify
removed
tpu
qianlihuang
reopened this
25 days ago
mergify
removed
needs-rebase
qianlihuang
force pushed
to
ac443b33
25 days ago
qianlihuang
marked this pull request as ready for review
25 days ago
qianlihuang
marked this pull request as draft
25 days ago
qianlihuang
marked this pull request as ready for review
25 days ago
benchislett
commented on 2026-02-04
qianlihuang
marked this pull request as draft
23 days ago
qianlihuang
force pushed
to
0cee7fa8
23 days ago
qianlihuang
marked this pull request as ready for review
23 days ago
benchislett
commented on 2026-02-17
benchislett
commented on 2026-02-17
mergify
added
needs-rebase
mergify
removed
needs-rebase
qianlihuang
requested a review
from
NickLucche
4 days ago
[Core] Support min_tokens with speculative decoding
f69f5526
qianlihuang
force pushed
from
468b2fac
to
f69f5526
4 days ago
benchislett
commented on 2026-02-25
benchislett
commented on 2026-02-25
add comment
c325db58
benchislett
approved these changes on 2026-02-25
benchislett
added
ready
benchislett
enabled auto-merge (squash)
3 days ago
Update custom logitsproc spec-dec test expectation
1abd6dd8
disabled auto-merge
2 days ago
Head branch was pushed to by a user without write access
Merge branch 'main' into feature/spec-dec-min-tokens
6e4252c4
benchislett
merged
d9406076
into main
1 day ago
Login to write a write a comment.
Login via GitHub
Reviewers
benchislett
copilot-pull-request-reviewer
gemini-code-assist
22quinn
houseroad
njhill
NickLucche
Assignees
No one assigned
Labels
documentation
performance
new-model
rocm
structured-output
frontend
speculative-decoding
ready
ci/build
v1
multi-modality
tool-calling
llama
qwen
deepseek
cpu
gpt-oss
kv-connector
nvidia
Milestone
No milestone
Login to write a write a comment.
Login via GitHub