vllm
[Spec decode] automatically disable mm for text-only draft models
#25667
Merged

[Spec decode] automatically disable mm for text-only draft models #25667

jmkuebler
jmkuebler automatically disable mm for text-only draft models
180a53eb
jmkuebler jmkuebler requested a review from benchislett benchislett 78 days ago
jmkuebler jmkuebler requested a review from luccafong luccafong 78 days ago
mergify mergify added speculative-decoding
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-09-25
jmkuebler make more robust
4be35841
DarkLight1337
DarkLight1337 commented on 2025-09-25
jmkuebler add temporary test fix
f46ab4b9
jmkuebler make diff smaller
8fcc3cd4
DarkLight1337
DarkLight1337 commented on 2025-09-26
jmkuebler
jmkuebler commented on 2025-09-26
DarkLight1337
DarkLight1337 commented on 2025-09-26
jmkuebler Merge branch 'main' into enable_text_only_for_MM
5e99cd24
jmkuebler update FA constant and enforce eager (temporarily)
a4108a4b
jmkuebler
jmkuebler commented on 2025-09-26
jmkuebler
jmkuebler remove eager flag. After rebasing it works also w/ compliation
45d47146
jmkuebler remove V1 flags
710c27d8
jmkuebler jmkuebler requested a review from DarkLight1337 DarkLight1337 77 days ago
DarkLight1337
jmkuebler
DarkLight1337
jmkuebler use large_gpu_mark
cd13e81b
jmkuebler
jmkuebler commented on 2025-09-26
jmkuebler
jmkuebler commented on 2025-09-26
jmkuebler
DarkLight1337
DarkLight1337 approved these changes on 2025-09-26
DarkLight1337
DarkLight1337 DarkLight1337 added ready
DarkLight1337
DarkLight1337 commented on 2025-09-26
jmkuebler Merge branch 'main' into enable_text_only_for_MM
de4336b0
DarkLight1337 DarkLight1337 merged 6f5c0931 into main 77 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone