vllm
[Frontend] Add chunked processing to handle long inputs in embedding models
#22280
Merged

[Frontend] Add chunked processing to handle long inputs in embedding models #22280

x22x22
x22x22 The latest update introduces new text embedding examples and service …
9ebc61b0
x22x22 x22x22 requested a review from simon-mo simon-mo 160 days ago
x22x22 x22x22 requested a review from WoosukKwon WoosukKwon 160 days ago
x22x22 x22x22 requested a review from youkaichao youkaichao 160 days ago
x22x22 x22x22 requested a review from robertgshaw2-redhat robertgshaw2-redhat 160 days ago
x22x22 x22x22 requested a review from mgoin mgoin 160 days ago
x22x22 x22x22 requested a review from tlrmchlsmth tlrmchlsmth 160 days ago
x22x22 x22x22 requested a review from houseroad houseroad 160 days ago
x22x22 x22x22 requested a review from hmellor hmellor 160 days ago
x22x22 x22x22 requested a review from aarnphm aarnphm 160 days ago
github-actions
mergify mergify added documentation
mergify mergify added frontend
gemini-code-assist
gemini-code-assist commented on 2025-08-05
x22x22 修复合并多模态处理器参数的逻辑,确保正确合并传入的参数。更新了相关文件以使用新的合并方式。
cab8200b
x22x22 restore
57987aa8
x22x22 Feature: Implement chunk processing and maximum embedding length conf…
8e3ba726
x22x22 restore
f24b5468
x22x22 restore
b46791be
x22x22 Feature: Implementation of Chunk Processing for Embedding Requests of…
54c79301
DarkLight1337
DarkLight1337 commented on 2025-08-06
DarkLight1337
DarkLight1337 commented on 2025-08-06
DarkLight1337
DarkLight1337 commented on 2025-08-06
DarkLight1337
DarkLight1337 commented on 2025-08-06
DarkLight1337
DarkLight1337 commented on 2025-08-06
DarkLight1337
DarkLight1337 commented on 2025-08-06
x22x22 revert: restore processor.py and registry.py to main branch state
1ad1ae3b
x22x22 Refactor: Enhance the code structure and error handling logic for emb…
35e0aeed
x22x22 Refactor: Enhance the code structure and error handling logic for emb…
483be3eb
x22x22 Refactor: Enhance the code structure and error handling logic for emb…
d410c342
x22x22 Feature: Implementation of an automatic chunking mechanism for long t…
8880316f
x22x22 Feature: Implementation of an automatic chunking mechanism for long t…
6e624216
maxdebayser
maxdebayser commented on 2025-08-06
maxdebayser
maxdebayser commented on 2025-08-06
maxdebayser
maxdebayser commented on 2025-08-06
maxdebayser
maxdebayser commented on 2025-08-06
maxdebayser
maxdebayser commented on 2025-08-06
maxdebayser
maxdebayser commented on 2025-08-06
x22x22 Refactoring inelegant code
ae380ed6
x22x22 x22x22 requested a review from maxdebayser maxdebayser 159 days ago
x22x22 x22x22 requested a review from DarkLight1337 DarkLight1337 159 days ago
noooop
noooop commented on 2025-08-07
x22x22 Refactoring inelegant code
3ce8d47e
x22x22 Refactoring inelegant code
54ad46e3
x22x22 Refactoring inelegant code
503ab003
x22x22
x22x22 x22x22 requested a review from noooop noooop 159 days ago
hmellor
hmellor commented on 2025-08-08
mergify
mergify mergify added needs-rebase
x22x22 Merge branch 'main' into feat/support-long-text-embedding
a48c7c4f
mergify mergify removed needs-rebase
x22x22 Refactoring inelegant code
8949c8f3
x22x22 x22x22 requested a review from hmellor hmellor 155 days ago
x22x22 Refactoring inelegant code
d42419e4
x22x22 Refactoring inelegant code
ac5b69a0
x22x22
hmellor
x22x22
DarkLight1337
x22x22
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
x22x22 Refactoring inelegant code
b8fe2660
x22x22 Refactoring inelegant code
e9a5d70f
x22x22 Refactoring inelegant code
d0c1c9ee
x22x22 x22x22 requested a review from DarkLight1337 DarkLight1337 154 days ago
DarkLight1337
DarkLight1337 commented on 2025-08-11
DarkLight1337
DarkLight1337 commented on 2025-08-11
x22x22 Update vllm/entrypoints/openai/serving_embedding.py
4de2c2b3
x22x22 Refactoring inelegant code
dc067f37
x22x22
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
x22x22 Update vllm/entrypoints/openai/serving_embedding.py
cf19859a
x22x22 x22x22 requested a review from yewentao256 yewentao256 153 days ago
x22x22 x22x22 requested a review from ProExpertProg ProExpertProg 153 days ago
x22x22 Update vllm/entrypoints/openai/serving_embedding.py
8fab6039
x22x22 Update vllm/entrypoints/openai/serving_embedding.py
8c7d56b2
x22x22 Refactoring inelegant code
fa3b69f7
x22x22 Refactoring inelegant code
6584107a
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
maxdebayser
maxdebayser commented on 2025-08-13
x22x22 Refactoring inelegant code
f4d48ce7
maxdebayser
maxdebayser commented on 2025-08-13
x22x22 Refactoring inelegant code
94a75767
x22x22 Refactoring inelegant code
34441413
x22x22 Refactoring inelegant code
ac02136d
x22x22 Refactoring inelegant code
17c43170
x22x22 Refactoring inelegant code
8866b5d4
x22x22
DarkLight1337 Reduce diff
b5230ed8
DarkLight1337 Simplify
b362cbd0
DarkLight1337
DarkLight1337 approved these changes on 2025-08-13
DarkLight1337
DarkLight1337 Merge branch 'main' into feat/support-long-text-embedding
15c462bd
DarkLight1337 DarkLight1337 added ready
x22x22
x22x22 Refactoring inelegant code
d515efdb
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 152 days ago
vllm-bot vllm-bot merged 653124bd into main 152 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone