vllm
Add `/v1/chat/completions/batch` endpoint for batched chat completions
#38011
Merged

Add `/v1/chat/completions/batch` endpoint for batched chat completions #38011

MatejRojec
MatejRojec MatejRojec requested a review from DarkLight1337 DarkLight1337 9 days ago
MatejRojec MatejRojec requested a review from robertgshaw2-redhat robertgshaw2-redhat 9 days ago
MatejRojec MatejRojec requested a review from aarnphm aarnphm 9 days ago
MatejRojec MatejRojec requested a review from NickLucche NickLucche 9 days ago
MatejRojec MatejRojec requested a review from njhill njhill 9 days ago
MatejRojec MatejRojec requested a review from chaunceyjiang chaunceyjiang 9 days ago
MatejRojec MatejRojec requested a review from russellb russellb 9 days ago
MatejRojec MatejRojec closed this 9 days ago
mergify
mergify mergify added documentation
mergify mergify added frontend
gemini-code-assist
gemini-code-assist commented on 2026-03-24
github-actions
MatejRojec MatejRojec reopened this 9 days ago
mergify
mergify
MatejRojec MatejRojec marked this pull request as draft 9 days ago
MatejRojec MatejRojec force pushed from e30acdb6 to 234ec822 9 days ago
MatejRojec MatejRojec force pushed from 234ec822 to 7e986412 9 days ago
MatejRojec MatejRojec marked this pull request as ready for review 9 days ago
MatejRojec MatejRojec marked this pull request as draft 9 days ago
MatejRojec MatejRojec marked this pull request as ready for review 9 days ago
MatejRojec MatejRojec changed the title Add batch requests to vllm api Add batched messages support to /v1/chat/completions 9 days ago
DarkLight1337
MatejRojec MatejRojec marked this pull request as draft 8 days ago
MatejRojec MatejRojec marked this pull request as ready for review 8 days ago
MatejRojec MatejRojec marked this pull request as draft 8 days ago
MatejRojec MatejRojec changed the title Add batched messages support to /v1/chat/completions Add `/v1/chat/completions/batch` endpoint for batched chat completions 8 days ago
MatejRojec MatejRojec marked this pull request as ready for review 8 days ago
MatejRojec
DarkLight1337
DarkLight1337 commented on 2026-03-25
DarkLight1337
DarkLight1337 commented on 2026-03-25
MatejRojec MatejRojec requested a review from DarkLight1337 DarkLight1337 8 days ago
DarkLight1337
DarkLight1337 commented on 2026-03-25
MatejRojec MatejRojec requested a review from DarkLight1337 DarkLight1337 8 days ago
DarkLight1337
DarkLight1337 commented on 2026-03-25
MatejRojec MatejRojec requested a review from DarkLight1337 DarkLight1337 8 days ago
DarkLight1337
DarkLight1337 commented on 2026-03-25
MatejRojec MatejRojec requested a review from DarkLight1337 DarkLight1337 8 days ago
DarkLight1337
DarkLight1337 approved these changes on 2026-03-25
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 8 days ago
DarkLight1337 DarkLight1337 added ready
mergify
disabled auto-merge 8 days ago
Head branch was pushed to by a user without write access
DarkLight1337
DarkLight1337 commented on 2026-03-25
MatejRojec Add batch requests to vllm api
7d93bde1
MatejRojec Fix n>1 validation and echo condition for batched chat completions
47e124ef
MatejRojec Fix pre-commit warnings
b8a91ffc
MatejRojec Update system prompt in .sh file
eb2e5313
MatejRojec Add edge case to manual tests
2c76e2af
MatejRojec Improve test by making them more robust
6e268e9f
MatejRojec Improve documentation and add ordering so results are alwaus ordered …
724baee7
MatejRojec Fix pre commit checks with ruff
0d2848a5
MatejRojec Implement batch completions as a separate endpoint
09177f48
MatejRojec Fix None request_id validation in to_chat_completion_request
819052d8
MatejRojec Replace shell script example with Python script
cf107e3a
MatejRojec Move render_batch_chat_request into OpenAIServingChat
47b92b9d
MatejRojec Move render_batch_chat_request after render_chat_request
27706be1
MatejRojec Move batch chat completion logic into OpenAIServingChatBatch subclass
158ad59c
MatejRojec Move import json to top of test_chat_completion.py
d199a6f5
MatejRojec Move batch chat completion tests to dedicated test file
b92996a0
MatejRojec Add separate state for batch chat completion handler
92b996af
MatejRojec Fix mypy errors in OpenAIServingChatBatch
9bd033aa
MatejRojec Add BatchChatCompletionRequest to ChatLikeRequest type alias
f2600beb
MatejRojec MatejRojec force pushed from ed06ea40 to f2600beb 8 days ago
DarkLight1337
gemini-code-assist
gemini-code-assist commented on 2026-03-25
MatejRojec Add validator to BatchChatCompletionRequest
bc23224a
DarkLight1337
DarkLight1337 approved these changes on 2026-03-25
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 8 days ago
MatejRojec Fix schemathesis failures for batch chat completion endpoint
eddd17bd
disabled auto-merge 8 days ago
Head branch was pushed to by a user without write access
MatejRojec Merge branch 'main' into feature/add-batch-requests-to-chat-completio…
434a529a
MatejRojec Fix ProcessorInputs import path in batch_serving.py
2dc00654
MatejRojec Fix EngineInput import in batch_serving.py
cc69644a
MatejRojec
DarkLight1337
DarkLight1337 approved these changes on 2026-03-26
DarkLight1337 DarkLight1337 merged 29080945 into main 7 days ago
MatejRojec MatejRojec deleted the feature/add-batch-requests-to-chat-completions-api branch 7 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone