vllm
[Model][Speculative Decoding] Expand DeepSeek MTP code to support k > n_predict
#13626
Merged

[Model][Speculative Decoding] Expand DeepSeek MTP code to support k > n_predict #13626

benchislett
benchislett expand mtp code to support k>n_predict
3692f83b
github-actions
mergify mergify added speculative-decoding
simon-mo simon-mo requested a review from LiuXiaoxuanPKU LiuXiaoxuanPKU 292 days ago
benchislett Merge branch 'main' into deepseek-multi-mtp
570da361
luyuzhe111
luyuzhe111 commented on 2025-02-24
luyuzhe111
luyuzhe111 commented on 2025-02-24
benchislett Enable returning hidden states in draft model runner
637dc639
benchislett Merge branch 'main' into deepseek-multi-mtp
f3296941
benchislett tiny refactor
0f7d6790
benchislett formatting fixes
0eb2fad0
benchislett simplify return hidden states in TP1DraftModelRunner
c308daab
mergify
mergify mergify added needs-rebase
luccafong
luccafong commented on 2025-02-25
benchislett Merge branch 'main' into deepseek-multi-mtp
627a8caf
mergify mergify removed needs-rebase
LiuXiaoxuanPKU
luccafong
benchislett
LiuXiaoxuanPKU
LiuXiaoxuanPKU approved these changes on 2025-02-25
luyuzhe111
benchislett clarifying comment in config.py
8143c908
benchislett make sure draft multi-step with mla uses fallback path
b5772f44
benchislett Merge branch 'main' into deepseek-multi-mtp
997ee093
LiuXiaoxuanPKU LiuXiaoxuanPKU added ready
LiuXiaoxuanPKU LiuXiaoxuanPKU merged 9804145c into main 285 days ago
Neo9061
benchislett
TianTengya
benchislett
TianTengya

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone