vllm
offload prompt_embeds decode in render_prompts_async to avoid blocking
#43792
Merged

offload prompt_embeds decode in render_prompts_async to avoid blocking #43792

gagandhakrey
gagandhakrey offload prompt_embeds decode in render_prompts_async to avoid blockin…
48528e20
gagandhakrey gagandhakrey requested a review from DarkLight1337 DarkLight1337 3 days ago
gagandhakrey gagandhakrey requested a review from njhill njhill 3 days ago
github-actions
gagandhakrey Merge branch 'main' into perf/render-prompts-async-offload
e24a2816
gagandhakrey gagandhakrey changed the title offload prompt_embeds decode in render_prompts_async to avoid blockin… offload prompt_embeds decode in render_prompts_async to avoid blocking 3 days ago
qthequartermasterman
qthequartermasterman approved these changes on 2026-05-29
qthequartermasterman
DarkLight1337 DarkLight1337 added verified
DarkLight1337
DarkLight1337 approved these changes on 2026-05-29
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 1 day ago
github-actions github-actions added ready
DarkLight1337 Merge branch 'main' into perf/render-prompts-async-offload
57e4318c
DarkLight1337 DarkLight1337 merged 1e2ce5d1 into main 1 day ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone