vllm
24b0205f
- [V1][Frontend] Coalesce bunched `RequestOutput`s (#12298)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Comment Changes
Previous Change (CTRL+↑)
Next Change (CTRL+↓)
Expand Context Lines
Collapse Context Lines
Hide Minimap (CTRL+M)
Commit
137 days ago
[V1][Frontend] Coalesce bunched `RequestOutput`s (#12298) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Robert Shaw <rshaw@neuralmagic.com>
References
#12298 - [V1][Frontend] Coalesce bunched `RequestOutput`s
Author
njhill
Parents
c5cffcd0
Files
3
tests/v1/engine
test_async_llm.py
vllm
outputs.py
v1/engine
async_llm.py
Loading