vllm
Remove unused kwargs from model definitions
#13555
Merged

Remove unused kwargs from model definitions #13555

hmellor
hmellor Remove `kv_cache` and `attn_metadata` from `Attention`
28c7f270
hmellor Remove `attn_metadata` from `MambaMixer` 1 & 2
1fe2b0db
hmellor Remove `kv_caches` and `attn_metadata` from `forward` call
153d253f
hmellor Remove `kv_caches` and `attn_metadata` from new model docs
eb30940a
hmellor Remove `kv_caches` and `attn_metadata` from model interface
7a757531
hmellor Remove args from a batch of models
7ddfd1fb
github-actions
mergify mergify added documentation
hmellor Remove args from another batch of models
f8794e9d
hmellor hmellor added ready
hmellor hmellor marked this pull request as ready for review 356 days ago
hmellor Remove `attn_metadata` from a couple more places
f81cad0e
hmellor Attempt fix HPU model runner
6beb1b14
DarkLight1337
hmellor Update CPU model runners
c7840700
hmellor Update V1 GPU model runner
72450ae1
hmellor hmellor requested a review from WoosukKwon WoosukKwon 356 days ago
hmellor hmellor requested a review from robertgshaw2-redhat robertgshaw2-redhat 356 days ago
hmellor hmellor requested a review from njhill njhill 356 days ago
hmellor hmellor requested a review from ywang96 ywang96 356 days ago
hmellor hmellor requested a review from comaniac comaniac 356 days ago
hmellor hmellor requested a review from alexm-redhat alexm-redhat 356 days ago
mergify mergify added v1
DarkLight1337 DarkLight1337 requested a review from youkaichao youkaichao 356 days ago
hmellor Update draft model runner
fdda9c6a
hmellor Update enc dec model runner
f9a1ee8a
hmellor Update remaining non-device model runners
b91538a9
mergify mergify added speculative-decoding
hmellor Allow `kv_caches` to be passed to `execute_model`
59f01be8
hmellor
hmellor Update XPU model runner
778910f5
hmellor Update V1 GPU model runner
c7cd8522
hmellor Update OpenVINO model runner
334d2b37
hmellor Update Neuron model runner
0735ed90
hmellor Add unused `kv_caches` arg to runners to limit scope of PR
5a8a73d0
hmellor Update TPU V0 and V1
3b9a35b6
hmellor Update HPU model runner
bb094d23
hmellor Make `kv_caches` optional in `HPUModelRunner.execute_model`
46d8fabd
hmellor Make linter happy
39ad6d44
hmellor Fix whisper test
164ee323
hmellor Add `kv_caches` back to remaining `*ModelRunner.execute_model()`
f6c8e2a0
hmellor Fix kernel tests
c917880d
hmellor hmellor requested a review from tlrmchlsmth tlrmchlsmth 355 days ago
hmellor Kick CI
6a296980
hmellor Merge branch 'main' into remove-unused-attn-args
cd1e8452
hmellor Fix missing import
f8b4d362
hmellor Fix call to `execute_model` in encoder decoder model runner
39742a3a
hmellor Fix call to `execute_model` in XPU model runner
cc087b05
hmellor Fix call to `execute_model` in multi-step model runner
6f703ba5
hmellor Fix V1 TPU model runner
d0ee4313
hmellor Fix multi-step model runner
29cff77f
hmellor Merge branch 'main' into remove-unused-attn-args
7e0c8083
comaniac
hmellor
comaniac
comaniac approved these changes on 2025-02-21
hmellor hmellor closed this 354 days ago
hmellor hmellor reopened this 354 days ago
hmellor
simon-mo
hmellor
hmellor Deprecate args in `Attention.forward` instead
5d84b992
hmellor
youkaichao
youkaichao
youkaichao approved these changes on 2025-02-22
youkaichao
youkaichao commented on 2025-02-22
hmellor Revert "Deprecate args in `Attention.forward` instead"
8925e30a
youkaichao
youkaichao commented on 2025-02-22
hmellor
heheda12345
heheda12345 approved these changes on 2025-02-23
hmellor Merge branch 'main' into remove-unused-attn-args
b7ec2d91
hmellor Fix `mllama` KV cache access
a775d1ce
simon-mo simon-mo merged cdc1fa12 into main 350 days ago
hmellor hmellor deleted the remove-unused-attn-args branch 350 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone