vllm
[Model] Refactoring of MiniCPM-V and add MiniCPM-o-2.6 support for vLLM
#12069
Merged

[Model] Refactoring of MiniCPM-V and add MiniCPM-o-2.6 support for vLLM #12069

DarkLight1337 merged 118 commits into vllm-project:main from minicpmv-refactor
HwwwwwwwH
github-actions
DarkLight1337 DarkLight1337 assigned DarkLight1337 DarkLight1337 1 year ago
ywang96
HwwwwwwwH
HwwwwwwwH
DarkLight1337
HwwwwwwwH refactor for images
f78ad12a
HwwwwwwwH supprot image embedding for minicpmv
95230b95
llsj14 [Bugfix][SpecDecode] Adjust Eagle model architecture to align with in…
42ffb1b2
shaochangxu [Bugfix] fused_experts_impl wrong compute type for float32 (#11921)
43ff2e9e
DarkLight1337 [CI/Build] Move model-specific multi-modal processing tests (#11934)
0ec99742
DarkLight1337 [Doc] Basic guide for writing unit tests for new models (#11951)
b4a90946
NickLucche [Bugfix] Fix RobertaModel loading (#11940)
ac291981
sixsixcoder [Model] Add cogagent model support vLLM (#11742)
286107f3
ywang96 [V1] Avoid sending text prompt to core engine (#11963)
535e120b
rafvasq [CI/Build] Add markdown linter (#11857)
925562bb
Isotr0py [Model] Initialize support for Deepseek-VL2 models (#11578)
936b3067
Akshat-Tripathi [Hardware][CPU] Multi-LoRA implementation for the CPU backend (#11100)
141151f2
avshalomman [Hardware][TPU] workaround fix for MoE on TPU (#11764)
eac78115
robertgshaw2-redhat [V1][Core][1/n] Logging and Metrics (#11962)
e2518667
Isotr0py [Model] Support GGUF models newly added in `transformers` 4.46.0 (#9685)
e46c06bf
robertgshaw2-redhat [V1] [2/n] Logging and Metrics - `OutputProcessor` Abstraction (#11973)
d12c0de4
yyccli [MISC] fix typo in kv transfer send recv test (#11983)
e459c90f
liaoyanqing666 [Bug] Fix usage of `.transpose()` and `.view()` consecutively. (#11979)
93a78bae
llsj14 [CI][Spec Decode] fix: broken test for EAGLE model (#11972)
dd2f6271
Concurrensee [Misc] Fix Deepseek V2 fp8 kv-scale remapping (#11947)
570e067c
noemotiovon [Misc]Minor Changes about Worker (#11555)
eaccb74b
youkaichao [platform] add ray_device_key (#11948)
7adb4a03
alex-jw-brooks Fix Max Token ID for Qwen-VL-Chat (#11980)
a014ddd3
heheda12345 [Kernel] unified_attention for Attention.forward (#11967)
cedf6cc5
ywang96 [Doc][V1] Update model implementation guide for V1 support (#11998)
a1f053f3
hmellor [Doc] Organise installation documentation into categories and tabs (#…
651ee498
youkaichao [platform] add device_control env var (#12009)
adc0b547
shen-shanshan [Platform] Move get_punica_wrapper() function to Platform (#11516)
1fa0b25e
e1ijah1 bugfix: Fix signature mismatch in benchmark's `get_tokenizer` functio…
e55869e2
Yikun [Doc] Fix build from source and installation link in README.md (#12013)
7f2aa685
SunflowerAries [Bugfix] Fix deepseekv3 gate bias error (#12002)
a1f08148
WoosukKwon [Docs] Add Sky Computing Lab to project intro (#12019)
0ca468e2
kzawora-intel [HPU][Bugfix] set_forward_context and CI test execution (#12014)
6bec0d04
tjtanaa [Doc] Update Quantization Hardware Support Documentation (#12025)
0badf142
youkaichao [HPU][misc] add comments for explanation (#12034)
c6a5060b
DarkLight1337 [Bugfix] Fix various bugs in multi-modal processor (#12031)
055a2b7c
heheda12345 [Kernel] Revert the API change of Attention.forward (#12038)
941a5d5c
wangxiyuan [Platform] Add output for Attention Backend (#11981)
3a05c492
heheda12345 [Bugfix][Kernel] Give unique name to BlockSparseFlashAttention (#12040)
87a687bc
hmellor Explain where the engine args go when using Docker (#12041)
3183e6ac
maang-h [Doc]: Update the Json Example of the `Engine Arguments` document (#1…
cc9cde52
jeejeelee [Misc] Merge bitsandbytes_stacked_params_mapping and packed_modules_…
58d45cd2
jeejeelee [Kernel] Support MulAndSilu (#11624)
bb13b8a6
kzawora-intel [HPU][Bugfix] Don't use /dev/accel/accel0 for HPU autodetection in se…
1bba3f63
shen-shanshan [Platform] move current_memory_usage() into platform (#11369)
ef22c6cf
WoosukKwon [V1][BugFix] Fix edge case in VLM scheduling (#12065)
94adbff1
elfiegg [Misc] Add multipstep chunked-prefill support for FlashInfer (#10467)
654f5d7f
ruisearch42 [core] Turn off GPU communication overlap for Ray executor (#12051)
8146c68b
youkaichao [core] platform agnostic executor via collective_rpc (#11256)
59e5cf4c
HwwwwwwwH merge main
920038b3
HwwwwwwwH video embedding supports
6f6d2eb0
HwwwwwwwH update support for minicpmo on images and videos
364bca1b
HwwwwwwwH audio language
c2d8dbb8
HwwwwwwwH audio embedding inputs
1ba77eb4
HwwwwwwwH format
1c6f7d84
HwwwwwwwH HwwwwwwwH force pushed to 1c6f7d84 1 year ago
mergify mergify added documentation
mergify mergify added ci/build
mergify
mergify mergify added frontend
mergify mergify added needs-rebase
HwwwwwwwH merge main x
26d40a57
HwwwwwwwH merge main
24d9a809
mergify mergify removed needs-rebase
jeejeelee Merge branch 'main' of https://github.com/vllm-project/vllm into mini…
29774db0
HwwwwwwwH docs/server-chat-utils/tests for minicpmo
6c409c5e
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
ee2f7da8
HwwwwwwwH HwwwwwwwH marked this pull request as ready for review 1 year ago
HwwwwwwwH HwwwwwwwH requested a review from DarkLight1337 DarkLight1337 1 year ago
HwwwwwwwH HwwwwwwwH requested a review from ywang96 ywang96 1 year ago
DarkLight1337
DarkLight1337 commented on 2025-01-23
HwwwwwwwH Update docs/source/models/supported_models.md
42e7e782
HwwwwwwwH Update tests/models/decoder_only/vision_language/test_models.py
6c0a6863
HwwwwwwwH format
c15228b8
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
c51026de
DarkLight1337
DarkLight1337 commented on 2025-01-23
DarkLight1337
DarkLight1337 commented on 2025-01-23
DarkLight1337
DarkLight1337 commented on 2025-01-23
DarkLight1337
DarkLight1337 commented on 2025-01-23
HwwwwwwwH split minicpmo in a separate file
ac26f599
HwwwwwwwH format
8b0cbf7c
DarkLight1337
DarkLight1337 commented on 2025-01-23
DarkLight1337
DarkLight1337 commented on 2025-01-23
DarkLight1337
DarkLight1337 commented on 2025-01-23
HwwwwwwwH Update vllm/model_executor/models/minicpmo.py
428ae5aa
HwwwwwwwH add hints
edfac98d
HwwwwwwwH format
4ed8b116
DarkLight1337
DarkLight1337 commented on 2025-01-24
DarkLight1337
DarkLight1337 commented on 2025-01-24
HwwwwwwwH clean unnecessary logic of WhisperEncoder
b44085ee
HwwwwwwwH format
763c5784
DarkLight1337
DarkLight1337
DarkLight1337 commented on 2025-01-24
PancakeAwesome
HwwwwwwwH
HwwwwwwwH Update vllm/model_executor/models/minicpmo.py
cd684848
HwwwwwwwH add torchaudio for test
1e47208d
HwwwwwwwH add annotations
781d1c38
HwwwwwwwH format
f0b0270c
zhy844694805
zhy844694805
HwwwwwwwH
ywang96
ywang96 ywang96 assigned ywang96 ywang96 1 year ago
ywang96 Merge remote-tracking branch 'upstream/main' into minicpmv-refactor
ed1dd9eb
HwwwwwwwH enable MiniCPMV-MiniCPMO for cache
6d5978a3
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
3bb67f88
HwwwwwwwH add multimodal tests for minicpmv
25d86ce7
HwwwwwwwH format
bec9a733
HwwwwwwwH custom_hf_runner for minicpmo
2120dd6e
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
0fd43479
HwwwwwwwH format
6d2f4e41
HwwwwwwwH pass all tests
fac61ebf
HwwwwwwwH format / pass all tests
6037606d
HwwwwwwwH
DarkLight1337
DarkLight1337 commented on 2025-01-27
HwwwwwwwH fix num_slices bug
b6f24f70
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
e439d3a7
Isotr0py
Isotr0py commented on 2025-01-27
HwwwwwwwH add examples
0f67ac91
HwwwwwwwH add examples and format tests
eab479fe
HwwwwwwwH format
05a0ef81
HwwwwwwwH Update tests/models/decoder_only/vision_language/vlm_utils/model_util…
6650450b
HwwwwwwwH Update vllm/model_executor/models/minicpmv.py
8f5b0690
HwwwwwwwH Update vllm/model_executor/models/minicpmv.py
de0b55f6
HwwwwwwwH Update vllm/model_executor/models/minicpmv.py
ad528591
HwwwwwwwH Update vllm/model_executor/models/minicpmv.py
c5b912d8
HwwwwwwwH Update vllm/model_executor/models/minicpmo.py
00e9e5a4
HwwwwwwwH alphabet
49ea11e3
HwwwwwwwH add annotations
595c6791
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
061596f8
HwwwwwwwH add torchaudio dependency
26d4b2bd
HwwwwwwwH format
5867171d
HwwwwwwwH torchaudio
bed7843f
HwwwwwwwH fix minicpmo_patch_hf_runner
715bd9fd
HwwwwwwwH fix slice bug
cf4788fb
HwwwwwwwH Merge branch 'main' into minicpmv-refactor
53c679e7
HwwwwwwwH format
3127a6b6
ywang96 ywang96 added ready
HwwwwwwwH test model register
290795b2
HwwwwwwwH delete minicpmv2.5 in test_common
d9dedd70
HwwwwwwwH add dependencies of minicpmo audio tests
f6d5cfa9
HwwwwwwwH format
da2ddd3b
HwwwwwwwH add vocos in requirements_test.in
4222899b
HwwwwwwwH Merge branch 'minicpmv-refactor' of github.com:HwwwwwwwH/vllm into mi…
26ebc7cf
HwwwwwwwH alphabet in example file and server
2e93896c
HwwwwwwwH Merge branch 'main' into minicpmv-refactor
0dfa513c
DarkLight1337
DarkLight1337 approved these changes on 2025-01-28
mergify
mergify mergify added needs-rebase
DarkLight1337 Merge branch 'main' into minicpmv-refactor
dadd0300
mergify mergify removed needs-rebase
ywang96
ywang96 approved these changes on 2025-01-29
mergify
mergify mergify added needs-rebase
HwwwwwwwH merge main && fix conflict
f5a188a3
HwwwwwwwH delete vocos in setup.py
8216fd5f
mergify mergify removed needs-rebase
DarkLight1337
DarkLight1337 commented on 2025-01-29
HwwwwwwwH update docs
4cfd785c
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 1 year ago
DarkLight1337 DarkLight1337 merged d93bf4da into main 1 year ago
HwwwwwwwH
jasstionzyf
WangVertex
WangVertex
Jiltseb
HwwwwwwwH
Jiltseb
WangVertex
adamfluty

Login to write a write a comment.

Login via GitHub

Labels
Milestone