Pipes attn_logits_soft_cap through multi_queries_paged_attention #8583
Pipes attn_logits_soft_cap through multi_queries_paged_attention
2b980606
fenghuizhang
marked this pull request as ready for review 251 days ago
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
8106ad2d
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
8802322e
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
9e57ad42
lsy323
commented
on 2025-01-18
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
18358764
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
68cd431c
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
491dbdb1
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
b8660fed
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
351de895
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
2ce9e2fa
lsy323
approved these changes
on 2025-01-21
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
19cf3a0f
Implements attn_logits_soft_cap and pass it through multi_queries_pag…
172f9cdd
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub