vllm
24b0205f - [V1][Frontend] Coalesce bunched `RequestOutput`s (#12298)

Comment changes are shownComment changes are hidden
Commit
137 days ago
[V1][Frontend] Coalesce bunched `RequestOutput`s (#12298) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Robert Shaw <rshaw@neuralmagic.com>
Author
Parents
  • tests/v1/engine
    • File
      test_async_llm.py
  • vllm
    • File
      outputs.py
    • v1/engine
      • File
        async_llm.py