onnxruntime
Extend Attention Bias Broadcast Support
#21710
Merged

Extend Attention Bias Broadcast Support #21710

tianleiwu merged 14 commits into main from tlwu/mha_attn_bias
tianleiwu
tianleiwu broadcast attention_bias dim 0 and 1
91284f07
tianleiwu broadcast attn bias in decoder masked mha
c76f2940
tianleiwu tianleiwu marked this pull request as draft 1 year ago
tianleiwu Add MHA tests
a8cebba1
tianleiwu rename relative_position_bias to attention_bias
c728b0be
tianleiwu fix build
2bff1881
github-advanced-security
github-advanced-security commented on 2024-08-15
tianleiwu update doc
58792dd5
tianleiwu Merge branch 'main' into tlwu/mha_attn_bias
801a86e3
tianleiwu format js
acfd6117
tianleiwu refactoring
1eb8c6b3
tianleiwu tianleiwu marked this pull request as ready for review 1 year ago
tianleiwu tianleiwu requested a review 1 year ago
tianleiwu tianleiwu requested a review from fs-eire fs-eire 1 year ago
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
tianleiwu tianleiwu requested a review from yufenglee yufenglee 1 year ago
wangyems
wangyems commented on 2024-08-15
tianleiwu refactoring cpu; add comments
6766b17e
tianleiwu refine softmax kernel
4984b455
tianleiwu benchmark mha with attention bias
a7e221b9
tianleiwu mark maybe_unused
1226c6d0
tianleiwu refine attn_bias_offset for dmmha with asummption of S=1
0c7f3952
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2024-08-16
jchen351
jchen351 approved these changes on 2024-08-16
tianleiwu tianleiwu merged d79e3c57 into main 1 year ago
tianleiwu tianleiwu deleted the tlwu/mha_attn_bias branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone