vllm
[TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU.
#15732
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
8
Changes
View On
GitHub
[TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU.
#15732
robertgshaw2-redhat
merged 8 commits into
vllm-project:main
from
vanbasten23:xiowei/sw_logit_cap
mergify
added
ci/build
mergify
added
v1
mergify
added
tpu
mergify
added
needs-rebase
vanbasten23
force pushed
330 days ago
mergify
removed
needs-rebase
vanbasten23
marked this pull request as ready for review
330 days ago
vanbasten23
requested a review
from
DarkLight1337
330 days ago
vanbasten23
requested a review
from
robertgshaw2-redhat
330 days ago
vanbasten23
requested a review
from
simon-mo
330 days ago
vanbasten23
requested a review
from
WoosukKwon
330 days ago
vanbasten23
requested a review
from
njhill
330 days ago
vanbasten23
requested a review
from
ywang96
330 days ago
vanbasten23
requested a review
from
comaniac
330 days ago
vanbasten23
requested a review
from
alexm-redhat
330 days ago
yaochengji
commented on 2025-04-01
yaochengji
commented on 2025-04-01
vanbasten23
requested a review
from
yaochengji
329 days ago
yaochengji
approved these changes on 2025-04-02
NickLucche
approved these changes on 2025-04-02
vanbasten23
force pushed
328 days ago
bvrockwell
requested changes on 2025-04-02
vanbasten23
changed the title
[TPU] Add sliding window and logit soft capping support for TPU.
[TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU.
328 days ago
bvrockwell
approved these changes on 2025-04-02
robertgshaw2-redhat
approved these changes on 2025-04-03
mergify
added
needs-rebase
robertgshaw2-redhat
enabled auto-merge (squash)
328 days ago
mergify
removed
needs-rebase
github-actions
added
ready
add sliding window and logit soft capping
ad5a0942
add test
e8188ab6
Add test for the sliding window and logit softcapping
967ea6df
fix the test faster the rebase.
b838faba
adding test for gemma
de1b347b
fix the ci
497be2cb
fix linter
7af49a16
fix comment
352fa122
disabled auto-merge
327 days ago
Head branch was pushed to by a user without write access
vanbasten23
force pushed
to
352fa122
327 days ago
robertgshaw2-redhat
merged
b6be6f8d
into main
327 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
robertgshaw2-redhat
NickLucche
yaochengji
bvrockwell
DarkLight1337
simon-mo
WoosukKwon
njhill
ywang96
comaniac
alexm-redhat
Assignees
No one assigned
Labels
tpu
ready
ci/build
v1
Milestone
No milestone
Login to write a write a comment.
Login via GitHub