vllm
Implicit language-model-only mode via limit-mm-per-prompt
#22299
Merged

Implicit language-model-only mode via limit-mm-per-prompt #22299

vllm-bot merged 73 commits into vllm-project:main from ywang96:main
ywang96
Test multimodal limit disables model
16e2d78d
update
d6665597
add models
7714cfac
add llava
a2f47208
Merge pull request #3 from ywang96/codex/add-text-only-mode-for-llama4
2f5da46a
add qwen2.5omni
f9d47096
ywang96 ywang96 changed the title Implict language-model-only mode via limit-mm-per-prompt Implicit language-model-only mode via limit-mm-per-prompt 284 days ago
mergify mergify added llama
mergify mergify added qwen
gemini-code-assist
gemini-code-assist commented on 2025-08-06
move test
3bc743dc
update
7478cdcb
update
9c66f5d5
github-actions
ywang96 ywang96 requested a review from Isotr0py Isotr0py 284 days ago
ywang96 ywang96 requested a review from DarkLight1337 DarkLight1337 284 days ago
ywang96
ywang96 ywang96 marked this pull request as ready for review 284 days ago
ywang96 ywang96 requested a review from patrickvonplaten patrickvonplaten 284 days ago
ywang96 ywang96 requested a review from sighingnow sighingnow 284 days ago
ywang96 ywang96 requested a review from simon-mo simon-mo 284 days ago
ywang96 ywang96 requested a review from WoosukKwon WoosukKwon 284 days ago
ywang96 ywang96 requested a review from youkaichao youkaichao 284 days ago
ywang96 ywang96 requested a review from robertgshaw2-redhat robertgshaw2-redhat 284 days ago
ywang96 ywang96 requested a review from mgoin mgoin 284 days ago
ywang96 ywang96 requested a review from tlrmchlsmth tlrmchlsmth 284 days ago
ywang96 ywang96 requested a review from houseroad houseroad 284 days ago
ywang96 ywang96 requested a review from hmellor hmellor 284 days ago
ywang96 ywang96 removed review request from tlrmchlsmth tlrmchlsmth 284 days ago
ywang96 ywang96 removed review request from mgoin mgoin 284 days ago
ywang96 ywang96 removed review request from sighingnow sighingnow 284 days ago
ywang96 ywang96 removed review request from hmellor hmellor 284 days ago
ywang96 ywang96 removed review request from simon-mo simon-mo 284 days ago
ywang96 ywang96 removed review request from youkaichao youkaichao 284 days ago
ywang96 ywang96 removed review request from patrickvonplaten patrickvonplaten 284 days ago
ywang96 ywang96 removed review request from houseroad houseroad 284 days ago
ywang96 ywang96 removed review request from WoosukKwon WoosukKwon 284 days ago
ywang96 ywang96 removed review request from robertgshaw2-redhat robertgshaw2-redhat 284 days ago
DarkLight1337
DarkLight1337 commented on 2025-08-06
sfeng33
sfeng33 commented on 2025-08-06
Isotr0py
Isotr0py approved these changes on 2025-08-06
ywang96 ywang96 marked this pull request as draft 284 days ago
ywang96
address comment
3048ba35
revert changes
31f45f53
refactor
dc204584
mergify mergify added multi-modality
mergify mergify added v1
update
cf71e91a
update message
f3a7a3e2
cache call
741c1b0c
ywang96
ywang96 commented on 2025-08-06
update
64433036
update
8881e6bc
mergify mergify added tpu
make mypy happy
e862c6be
add qwen2vl
73a1a6ed
update message
fc417006
fix mypy
69526d3f
hmellor
ywang96
sfeng33
sfeng33 commented on 2025-08-06
add unittest
68b2413d
update test
434e9b3c
add to pipeline
7594efb9
mergify mergify added ci/build
ywang96 ywang96 marked this pull request as ready for review 283 days ago
ywang96 ywang96 requested a review from njhill njhill 283 days ago
ywang96 ywang96 requested a review from comaniac comaniac 283 days ago
ywang96 ywang96 requested a review from alexm-redhat alexm-redhat 283 days ago
ywang96 ywang96 removed review request from njhill njhill 283 days ago
ywang96 ywang96 removed review request from comaniac comaniac 283 days ago
ywang96 ywang96 removed review request from alexm-redhat alexm-redhat 283 days ago
ywang96 ywang96 requested a review from Isotr0py Isotr0py 283 days ago
ywang96 ywang96 requested a review from DarkLight1337 DarkLight1337 283 days ago
mgoin
mgoin approved these changes on 2025-08-07
comment
6744ac18
DarkLight1337
DarkLight1337 commented on 2025-08-07
DarkLight1337
DarkLight1337 commented on 2025-08-07
DarkLight1337
DarkLight1337 commented on 2025-08-07
DarkLight1337
DarkLight1337 commented on 2025-08-07
DarkLight1337
DarkLight1337 commented on 2025-08-07
comments
e1748fa8
DarkLight1337
DarkLight1337 commented on 2025-08-07
update
e0f13838
delete
af746eec
Isotr0py
Isotr0py approved these changes on 2025-08-07
comment
aac7794e
update
1c6cef83
update test
5af9e582
simplify
15850be4
update
12966d49
mergify
mergify mergify added needs-rebase
Merge branch 'main' into main
acf0d5a8
mergify mergify removed needs-rebase
update
e941f333
update
e329b330
move
9b5a1767
ywang96 ywang96 added ready
update
0a6a655e
andyxning [Misc] normalize multiprocessing Queue usage (#22371)
006c0c8a
tjtanaa [ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (…
4f060b68
qthequartermasterman [PERF] Use pybase64 to more quickly decode prompt embeddings (#22469)
665698cd
Edwardf0t1 Add ModelOpt Qwen3 nvfp4 support (#20101)
a34e38fd
wenscarl Support Tensorrt-LLM MoE fp4 for low-latency (#21331)
8e049bef
wenscarl Fix Flashinfer CUTLASS MOE Allgather (#21963)
c7d8545c
0xjunhao [Kernel] Add support for block FP8 on SM120 (NVIDIA 5090 and RTX PRO …
4355cfb7
chaunceyjiang [Bugfix] Fix RuntimeError: Index put requires the source and destinat…
f20b0405
zRzRzRzRzRzRzR not tie_word_embeddings for glm-4.5 and glm-4.5v (#22460)
12b48536
skyloevil Optimize MiniCPMO mask creation with vectorized implementation (#22464)
85d4756e
DarkLight1337 Fix pre-commit (#22487)
64b3a57b
nvpohanh [bugfix] Fix Llama3/4 issues caused by FlashInfer 0.2.10 (#22426)
ebcb01cf
iAmir97 [Doc] Sleep mode documentation (#22310)
42f674c9
lk-chen [bench] Fix benchmark/serve.py to ignore unavailable results (#22382)
7723c454
DarkLight1337 [CI/Build] Fix multimodal tests (#22491)
cf8aab7a
ywang96 ywang96 requested a review from LucasWilkinson LucasWilkinson 282 days ago
ywang96 ywang96 requested a review from aarnphm aarnphm 282 days ago
ywang96 ywang96 requested a review from zhuohan123 zhuohan123 282 days ago
mergify mergify added documentation
mergify mergify added frontend
mergify mergify added performance
mergify mergify added speculative-decoding
ywang96 ywang96 removed review request from LucasWilkinson LucasWilkinson 282 days ago
ywang96 ywang96 removed review request from aarnphm aarnphm 282 days ago
ywang96 ywang96 removed review request from zhuohan123 zhuohan123 282 days ago
ywang96
Revert "[CI/Build] Fix multimodal tests (#22491)"
e05914e1
Revert "[bench] Fix benchmark/serve.py to ignore unavailable results …
244e08f6
Revert "[Doc] Sleep mode documentation (#22310)"
9e4da599
Revert "[bugfix] Fix Llama3/4 issues caused by FlashInfer 0.2.10 (#22…
df168cb8
Revert "Fix pre-commit (#22487)"
4b7992f4
Revert "Optimize MiniCPMO mask creation with vectorized implementatio…
83b7ff0e
Revert "not tie_word_embeddings for glm-4.5 and glm-4.5v (#22460)"
9c4d6e16
Revert "[Bugfix] Fix RuntimeError: Index put requires the source and …
ba90fb12
Revert "[Kernel] Add support for block FP8 on SM120 (NVIDIA 5090 and …
b613e220
Revert "Fix Flashinfer CUTLASS MOE Allgather (#21963)"
3d64d842
Revert "Support Tensorrt-LLM MoE fp4 for low-latency (#21331)"
0c2e1980
Revert "Add ModelOpt Qwen3 nvfp4 support (#20101)"
e8661ce4
Revert "[PERF] Use pybase64 to more quickly decode prompt embeddings …
2d6e3584
Revert "[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 …
03807c7c
Revert "[Misc] normalize multiprocessing Queue usage (#22371)"
dda8e001
Merge branch 'vllm-project:main' into main
1cfe05a5
ywang96 ywang96 enabled auto-merge (squash) 282 days ago
DarkLight1337
DarkLight1337 approved these changes on 2025-08-08
DarkLight1337 DarkLight1337 removed documentation
DarkLight1337 DarkLight1337 removed performance
DarkLight1337 DarkLight1337 removed speculative-decoding
DarkLight1337 DarkLight1337 removed ci/build
DarkLight1337 DarkLight1337 removed frontend
DarkLight1337 DarkLight1337 added ci/build
NickLucche
NickLucche requested changes on 2025-08-08
DarkLight1337
NickLucche
ywang96
undo
fc98f6f8
Merge branch 'vllm-project:main' into main
ac0fa665
delete
33d1f485
update
19ba00a2
ywang96
disabled auto-merge 281 days ago
Manually disabled by user
ywang96 ywang96 enabled auto-merge (squash) 281 days ago
disabled auto-merge 281 days ago
Manually disabled by user
ywang96 ywang96 enabled auto-merge (squash) 281 days ago
vllm-bot vllm-bot merged 08b751ba into main 281 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone