vllm
[Core] [Bugfix] Add Input Embeddings
#15428
Merged

[Core] [Bugfix] Add Input Embeddings #15428

qthequartermasterman
qthequartermasterman qthequartermasterman requested a review from zhuohan123 zhuohan123 335 days ago
qthequartermasterman qthequartermasterman requested a review from youkaichao youkaichao 335 days ago
qthequartermasterman qthequartermasterman requested a review from alexm-redhat alexm-redhat 335 days ago
qthequartermasterman qthequartermasterman requested a review from comaniac comaniac 335 days ago
qthequartermasterman qthequartermasterman requested a review from njhill njhill 335 days ago
github-actions
mergify mergify added frontend
qthequartermasterman qthequartermasterman force pushed 335 days ago
DarkLight1337 DarkLight1337 assigned DarkLight1337 DarkLight1337 335 days ago
liangwythu
DarkLight1337
DarkLight1337 commented on 2025-03-25
DarkLight1337
qthequartermasterman qthequartermasterman requested a review from ywang96 ywang96 334 days ago
qthequartermasterman
qthequartermasterman commented on 2025-03-25
qthequartermasterman qthequartermasterman force pushed 334 days ago
DarkLight1337 DarkLight1337 closed this 334 days ago
DarkLight1337 DarkLight1337 reopened this 334 days ago
DarkLight1337
DarkLight1337
DarkLight1337 commented on 2025-03-26
yukang2017
lzl-mt
lzl-mt commented on 2025-03-27
lzl-mt
mergify
mergify mergify added needs-rebase
(vllm) add input embedding
cef68948
Bryce1010 improve embedding input
c51d8fb5
Bryce1010 (vllm) fix import error
9564b402
Bryce1010 (vllm) fix pre commit error
c60298a4
qthequartermasterman apply ruff and isort fixes
0c24a823
qthequartermasterman apply ruff and isort fixes
403a1655
qthequartermasterman styling
b1ac0721
qthequartermasterman fix missing imports from rebase
0390c334
qthequartermasterman typing fixes
0ca4daec
qthequartermasterman type fix
35320fe4
qthequartermasterman type fix
0a77630c
qthequartermasterman remove unnecessary changes
11b6c02a
qthequartermasterman remove unnecessary changes
cb92a3dd
qthequartermasterman re-add deleted whitespace
375bd5b4
qthequartermasterman Include unit tests from #6869.
c9d80248
qthequartermasterman remove unrelated qwen2 changes
a64e6274
qthequartermasterman guard clause around fully consumed prompt embeds to avoid returning e…
6ab349eb
qthequartermasterman use v0 for prompt embeds model runner tests
26c87840
qthequartermasterman qthequartermasterman force pushed to 26c87840 326 days ago
mergify mergify removed needs-rebase
qthequartermasterman fix batching of input embeddings
b71a13c2
qthequartermasterman
qthequartermasterman style formatting
4aa9ade4
qthequartermasterman remove incorrect overload
e2c4c26d
qthequartermasterman remove incorrect overload
26d108ac
lzl-mt
lzl-mt
yukang2017
lzl-mt
qthequartermasterman
DarkLight1337
DarkLight1337 commented on 2025-04-04
DarkLight1337
qthequartermasterman Update representations
af204355
qthequartermasterman qthequartermasterman force pushed to af204355 325 days ago
qthequartermasterman remove unrelated changes to docs
25aaf3fe
qthequartermasterman remove unrelated typing change
bc058601
qthequartermasterman fix missing syntax
b55800d4
qthequartermasterman do not schedule prompt embeds and non-prompt embeds in the same batch
be42a17e
qthequartermasterman fix style linelength
c8fcfe41
SnowCharmQ
DarkLight1337
DarkLight1337
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
b21688fb
Aaronhuang-778
SnowCharmQ
SnowCharmQ
SnowCharmQ commented on 2025-04-08
qthequartermasterman
lzl-mt
qthequartermasterman
SnowCharmQ
Aaronhuang-778
Aaronhuang-778 commented on 2025-04-09
lzl-mt
Aaronhuang-778
lzl-mt
qthequartermasterman propogate embeddings for sampled output tokens for decoding
1e359ae5
mergify mergify added speculative-decoding
qthequartermasterman fix type check
59fbe702
qthequartermasterman do not schedule decode sequence groups with batches containing both p…
c152a3ac
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
42ad8007
qthequartermasterman fix type check
e7ab2a22
qthequartermasterman add default value to optional parameter
911adbe1
lzl-mt
qthequartermasterman remove unused comments
82d923d6
qthequartermasterman qthequartermasterman force pushed to 82d923d6 314 days ago
qthequartermasterman properly pass in placeholder token ids when testing prompt embeds
c9514792
qthequartermasterman do not test mixed token_ids/prompt_embeds batches in the model_runner
01e1a6eb
qthequartermasterman refactor cuda_prepare_decode test
193ad5cb
qthequartermasterman use correct expected input embeds length for prepare_decode_cuda_grap…
74bd9f46
qthequartermasterman add scheduler test to ensure prompt embeds and prompt tokens are not …
d949f1b0
qthequartermasterman
qthequartermasterman support inputs_embeds in compiled mode
62bbc881
qthequartermasterman fix typing in test
1d1ae4ba
qthequartermasterman use corrector operator precedence for handling empty strings
1914676b
qthequartermasterman only test decoder models with input embeds in v0 backend
70198f62
qthequartermasterman Merge branch 'vllm-project:main' into feature/vllm/add-input-embedding
934ceae6
qthequartermasterman
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
DarkLight1337
DarkLight1337 commented on 2025-04-17
qthequartermasterman adjust type hints for modelinputforgpubuilder.build
5595b452
qthequartermasterman simplify conditional logic
3343d3e0
qthequartermasterman simplify compilation conditional logic
5010ea02
qthequartermasterman refactor decoder only language model tests to reduce number of times …
2075e538
qthequartermasterman break up multiple assignments for readability
9a4fb3c0
qthequartermasterman update type hints in scheduler
8ad40915
qthequartermasterman clear existing lists instead of instantiating new ones
9055daf7
qthequartermasterman preprocess tensors to handle batched/misshaped prompt embeds to avoid…
9a57acaf
qthequartermasterman use seperate Embedsprompt class for preprocessing inputs embeddings
bbfb0f0e
qthequartermasterman fix typing
933e5672
qthequartermasterman fix type errors
4e0d12f6
qthequartermasterman Merge branch 'vllm-project:main' into feature/vllm/add-input-embedding
164aeb5e
qthequartermasterman fix mistaken type change
9e6909e6
qthequartermasterman add missing type hint
90b950ad
DarkLight1337
DarkLight1337 commented on 2025-04-19
qthequartermasterman add spaces for style
01d83f4d
qthequartermasterman seperate EmbedsInputs from TokenInputs and embeds_inputs from token_i…
69854521
qthequartermasterman fix docstrings for EmbedsInputs
e916551f
qthequartermasterman fix typing for token_type_ids
69f87250
qthequartermasterman fix typing for embeds_tokens in InputRegistry and InputsAdapter
9c2c89fb
DarkLight1337
DarkLight1337 commented on 2025-04-20
qthequartermasterman remove prompts and prompt_token_ids from EmbedsPrompts
499dc6a7
DarkLight1337
qthequartermasterman
DarkLight1337
KyleMylonakisProtopia
codexceed
mergify
mergify mergify added needs-rebase
wangxiyuan
DarkLight1337
DarkLight1337
leonardtang
qthequartermasterman
DarkLight1337
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
20668ca8
mergify mergify removed needs-rebase
qthequartermasterman fight mypy to get correct typing for not embeds prompts
6712ba67
qthequartermasterman
DarkLight1337
qthequartermasterman remove incorrect call to embeds_inputs
740b290b
qthequartermasterman wrestle with mypy and typeddict type narrowing
8f9bd513
qthequartermasterman wrestle with mypy and typeddict type narrowing
b8d36c69
DarkLight1337
DarkLight1337 commented on 2025-04-29
DarkLight1337
DarkLight1337 approved these changes on 2025-04-29
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 300 days ago
github-actions github-actions added ready
qthequartermasterman
DarkLight1337
qthequartermasterman support indexing graph runners that with inputs_embeds
b764c193
qthequartermasterman
disabled auto-merge 299 days ago
Head branch was pushed to by a user without write access
mergify
mergify mergify added needs-rebase
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
cb6ff22f
qthequartermasterman support encoder decoder models with inputs_embeds
85642d0c
qthequartermasterman simplify redundant ternary statement
b226fd6b
qthequartermasterman explicitly remove support for inputs embeds with speculative decoding…
b738d3fd
qthequartermasterman fix occasional device mismatch errors when appending output tokens to…
2340119d
mergify mergify removed needs-rebase
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
92b3264c
qthequartermasterman
qthequartermasterman Merge branch 'main' into feature/vllm/add-input-embedding
b9271c15
DarkLight1337
DarkLight1337 Fix a typo
28b0983b
vllm-bot vllm-bot merged cc2a77d7 into main 296 days ago
KyleMylonakisProtopia
qthequartermasterman
njhill
njhill njhill added v0
DarkLight1337
gshtras
gshtras commented on 2025-05-06

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone