vllm
[Spec decode] automatically disable mm for text-only draft models
#25667
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
10
Changes
View On
GitHub
[Spec decode] automatically disable mm for text-only draft models
#25667
DarkLight1337
merged 10 commits into
vllm-project:main
from
jmkuebler:enable_text_only_for_MM
automatically disable mm for text-only draft models
180a53eb
jmkuebler
requested a review
from
benchislett
78 days ago
jmkuebler
requested a review
from
luccafong
78 days ago
mergify
added
speculative-decoding
mergify
added
v1
gemini-code-assist
commented on 2025-09-25
make more robust
4be35841
DarkLight1337
commented on 2025-09-25
add temporary test fix
f46ab4b9
make diff smaller
8fcc3cd4
DarkLight1337
commented on 2025-09-26
jmkuebler
commented on 2025-09-26
DarkLight1337
commented on 2025-09-26
Merge branch 'main' into enable_text_only_for_MM
5e99cd24
update FA constant and enforce eager (temporarily)
a4108a4b
jmkuebler
commented on 2025-09-26
remove eager flag. After rebasing it works also w/ compliation
45d47146
remove V1 flags
710c27d8
jmkuebler
requested a review
from
DarkLight1337
77 days ago
use large_gpu_mark
cd13e81b
jmkuebler
commented on 2025-09-26
jmkuebler
commented on 2025-09-26
DarkLight1337
approved these changes on 2025-09-26
DarkLight1337
added
ready
DarkLight1337
commented on 2025-09-26
Merge branch 'main' into enable_text_only_for_MM
de4336b0
DarkLight1337
merged
6f5c0931
into main
77 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
DarkLight1337
gemini-code-assist
benchislett
luccafong
Assignees
No one assigned
Labels
speculative-decoding
ready
v1
Milestone
No milestone
Login to write a write a comment.
Login via GitHub