vllm
b6be6f8d
- [TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU. (#15732)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
326 days ago
[TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU. (#15732) Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>
References
#15732 - [TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU.
Author
vanbasten23
Parents
03a70eac
Loading