vllm
[Model][Speculative Decoding] Expand DeepSeek MTP code to support k > n_predict
#13626
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
11
Changes
View On
GitHub
[Model][Speculative Decoding] Expand DeepSeek MTP code to support k > n_predict
#13626
LiuXiaoxuanPKU
merged 11 commits into
vllm-project:main
from
CentML:deepseek-multi-mtp
expand mtp code to support k>n_predict
3692f83b
mergify
added
speculative-decoding
simon-mo
requested a review
from
LiuXiaoxuanPKU
292 days ago
Merge branch 'main' into deepseek-multi-mtp
570da361
luyuzhe111
commented on 2025-02-24
luyuzhe111
commented on 2025-02-24
Enable returning hidden states in draft model runner
637dc639
Merge branch 'main' into deepseek-multi-mtp
f3296941
tiny refactor
0f7d6790
formatting fixes
0eb2fad0
simplify return hidden states in TP1DraftModelRunner
c308daab
mergify
added
needs-rebase
luccafong
commented on 2025-02-25
Merge branch 'main' into deepseek-multi-mtp
627a8caf
mergify
removed
needs-rebase
LiuXiaoxuanPKU
approved these changes on 2025-02-25
clarifying comment in config.py
8143c908
make sure draft multi-step with mla uses fallback path
b5772f44
Merge branch 'main' into deepseek-multi-mtp
997ee093
LiuXiaoxuanPKU
added
ready
LiuXiaoxuanPKU
merged
9804145c
into main
285 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
LiuXiaoxuanPKU
luccafong
pyc96
luyuzhe111
Assignees
No one assigned
Labels
speculative-decoding
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub