xla
Fix an issue when piping attn_logits_soft_cap through in vllm.
#8600
Merged

Fix an issue when piping attn_logits_soft_cap through in vllm. #8600

lsy323 merged 14 commits into pytorch:master from fenghuizhang:master
fenghuizhang
fenghuizhang Pipes attn_logits_soft_cap through multi_queries_paged_attention
2b980606
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
8106ad2d
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
8802322e
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
9e57ad42
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
18358764
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
68cd431c
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
491dbdb1
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
b8660fed
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
351de895
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
2ce9e2fa
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
19cf3a0f
fenghuizhang Implements attn_logits_soft_cap and pass it through multi_queries_pag…
172f9cdd
fenghuizhang Fix the signature of paged_attention by marking attn_logits_soft_cap …
633792cf
fenghuizhang Merge branch 'pytorch:master' into master
28df218e
fenghuizhang fenghuizhang marked this pull request as ready for review 285 days ago
lsy323
lsy323 approved these changes on 2025-01-22
lsy323 lsy323 merged 5b877beb into master 285 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone