vllm
b6be6f8d - [TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU. (#15732)

Commit
326 days ago
[TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU. (#15732) Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>
Author
Parents
Loading