onnxruntime
DecoderMaskedMultiHeadAttention CPU kernel.
#22292
Merged

DecoderMaskedMultiHeadAttention CPU kernel. #22292

tianleiwu merged 14 commits into main from linmin/cpu_dmmha
mindest
mindest DecoderMaskedMultiHeadAttention CPU kernel.
10cd0279
mindest mindest requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
github-advanced-security
github-advanced-security commented on 2024-10-02
github-advanced-security
github-advanced-security commented on 2024-10-02
mindest Fix attention for no-beam case
f4dd1dc9
github-advanced-security
github-advanced-security commented on 2024-10-10
mindest Fix errors; update unit test cases
33fd0f10
github-advanced-security
github-advanced-security commented on 2024-10-10
mindest Fix some CI errors.
0af5deb1
tianleiwu
tianleiwu commented on 2024-10-10
tianleiwu
tianleiwu commented on 2024-10-10
mindest Fix: pick up local unstaged changes.
5e745902
tianleiwu
tianleiwu
tianleiwu commented on 2024-10-11
github-advanced-security
github-advanced-security commented on 2024-10-11
mindest Fix error; add broadcast for attn_bias; resolve comments
1c90af9f
github-advanced-security
github-advanced-security commented on 2024-10-11
mindest Format
ae766526
mindest mindest marked this pull request as ready for review 1 year ago
mindest Update doc.
caddf065
mindest Add updated op kernel doc.
b2e35b46
mindest mindest changed the title [WIP] DecoderMaskedMultiHeadAttention CPU kernel. DecoderMaskedMultiHeadAttention CPU kernel. 1 year ago
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
mindest Resolve comments.
4f96ddb4
tianleiwu
tianleiwu commented on 2024-10-11
tianleiwu
tianleiwu commented on 2024-10-11
github-advanced-security
github-advanced-security commented on 2024-10-11
mindest Resolve more comments; fix warning
3ee8d538
mindest Fix CI, warnings.
f9f4ff36
tianleiwu
tianleiwu dismissed these changes on 2024-10-12
kunal-vaishnavi
kunal-vaishnavi commented on 2024-10-12
kunal-vaishnavi
kunal-vaishnavi commented on 2024-10-12
github-advanced-security
github-advanced-security commented on 2024-10-12
mindest Fix warning; rename to output_qk
5b5e7913
mindest mindest dismissed their stale review via 5b5e7913 1 year ago
mindest typo
67a653e3
tianleiwu
tianleiwu approved these changes on 2024-10-12
tianleiwu tianleiwu merged 1fa219d7 into main 1 year ago
tianleiwu tianleiwu deleted the linmin/cpu_dmmha branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone