vllm
80fcc3ed - [Kernel] Pipe attn_logits_soft_cap through paged attention TPU kernels (#12482)

Commit

166 days ago

[Kernel] Pipe attn_logits_soft_cap through paged attention TPU kernels (#12482) Signed-off-by: Fenghui Zhang <fhzhang@google.com>

References

Author

fenghuizhang

Parents