onnxruntime
Attention Operator (CPU)
#25156
Merged

Attention Operator (CPU) #25156

xadupre merged 56 commits into main from xadupre/attention
xadupre
xadupre Skeleton for Attention Operator (CPU)
39ff7993
github-actions
github-actions commented on 2025-06-24
github-advanced-security
github-advanced-security commented on 2025-06-24
xadupre Update onnxruntime/core/providers/cpu/llm/attention.cc
ec9348ad
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
7ea6cbc1
xadupre first draft for attention
2174ab61
xadupre First working attention scenario
d03321ca
xadupre fix build issues
a6d803b3
xadupre fix with mask
72d8213b
xadupre add mask
ad946b7b
xadupre new addition
deaa572a
titaiwangms titaiwangms requested a review from titaiwangms titaiwangms 195 days ago
xadupre add test on causal
00fa987c
xadupre improve kernel
929bd731
xadupre fix 3D
a8d19b99
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
da457b95
xadupre add softcap
d6c6c46c
xadupre fix present_value
8c2a53e0
xadupre fix gqa
bdbb773b
xadupre Fix gda past, present
88a04a34
xadupre fix qkmode
3d07fecf
xadupre add more unit test
099ba9a7
xadupre add ort_enforce
d45fd010
xadupre more enforce
08b76dac
xadupre xadupre changed the title [DRAFT] Attention Operator (CPU) Attention Operator (CPU) 187 days ago
xadupre xadupre marked this pull request as ready for review 187 days ago
titaiwangms titaiwangms requested a review from tianleiwu tianleiwu 187 days ago
justinchuby justinchuby requested a review from justinchuby justinchuby 187 days ago
justinchuby justinchuby requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 187 days ago
justinchuby justinchuby added release:1.23.0
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
tianleiwu
tianleiwu commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
tianleiwu
tianleiwu commented on 2025-07-08
gramalingam
gramalingam commented on 2025-07-08
tianleiwu
tianleiwu commented on 2025-07-08
titaiwangms
titaiwangms commented on 2025-07-08
titaiwangms
titaiwangms commented on 2025-07-09
xadupre Update onnxruntime/core/providers/cpu/llm/attention.cc
f3ccc4f8
xadupre Update onnxruntime/core/providers/cpu/llm/attention.cc
7567629d
xadupre address PR comments
bb53f0fb
justinchuby justinchuby requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 186 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-07-09
xadupre improve for 2D mask
6457762c
xadupre comment
e76c56b1
xadupre refactor
ac719af3
titaiwangms titaiwangms requested a review from kunal-vaishnavi kunal-vaishnavi 185 days ago
xadupre useless variabel
6f45bb57
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
66109dbb
xadupre fix parameters
a58f41e6
kunal-vaishnavi
kunal-vaishnavi
kunal-vaishnavi commented on 2025-07-11
kunal-vaishnavi
kunal-vaishnavi commented on 2025-07-11
xadupre Update onnxruntime/core/providers/cpu/llm/attention_helper.h
3d9d5431
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
47c32044
github-actions
github-actions commented on 2025-07-15
github-advanced-security
github-advanced-security commented on 2025-07-15
xadupre Update onnxruntime/core/providers/cpu/llm/attention_helper.h
e1e9ec57
xadupre Merge branch 'xadupre/attention' of https://github.com/microsoft/onnx…
c1b4e120
xadupre add support for float16
2dfb5d4d
xadupre enable gemm for float16 in Attention
34262fb6
xadupre support for float16
e31b42d0
xadupre cast
eb6eef07
titaiwangms titaiwangms requested a review from tianleiwu tianleiwu 180 days ago
titaiwangms titaiwangms requested a review from kunal-vaishnavi kunal-vaishnavi 180 days ago
tianleiwu
tianleiwu commented on 2025-07-15
tianleiwu
tianleiwu commented on 2025-07-15
tianleiwu
tianleiwu commented on 2025-07-15
tianleiwu
tianleiwu commented on 2025-07-15
tianleiwu
tianleiwu commented on 2025-07-15
tianleiwu
tianleiwu commented on 2025-07-15
xadupre pr comments
bcc7c8a4
tianleiwu
tianleiwu commented on 2025-07-16
tianleiwu
tianleiwu commented on 2025-07-16
tianleiwu
tianleiwu commented on 2025-07-16
tianleiwu
tianleiwu commented on 2025-07-16
tianleiwu
tianleiwu commented on 2025-07-16
tianleiwu
tianleiwu commented on 2025-07-16
xadupre fix input transposition
5fd5ac89
xadupre fix gemm fp16
5c89fe2b
xadupre fix issues
14786ad7
xadupre fix an error message
1a2f5fc2
xadupre enables removed tes
876c8eff
xadupre fix fp16 implementation
6b393ba1
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
94466597
xadupre fix missing cast
5b826d30
xadupre disable one warning
1035cf8c
xadupre disable attention 3d tests
1456623a
xadupre update test cases
351c3036
xadupre remove unnecessary comment
2b936168
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
3bfc049b
xadupre disable two tests
c7eb814c
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
9cf5712e
justinchuby
justinchuby dismissed these changes on 2025-07-22
kunal-vaishnavi
kunal-vaishnavi commented on 2025-07-22
kunal-vaishnavi
kunal-vaishnavi commented on 2025-07-22
xadupre Update onnxruntime/core/providers/cpu/llm/attention.cc
45f5556b
xadupre xadupre dismissed their stale review via 45f5556b 173 days ago
xadupre xadupre requested a review from tianleiwu tianleiwu 173 days ago
xadupre Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
753dbbaa
titaiwangms titaiwangms assigned titaiwangms titaiwangms 171 days ago
justinchuby
justinchuby approved these changes on 2025-07-24
titaiwangms
titaiwangms approved these changes on 2025-07-24
tianleiwu
tianleiwu approved these changes on 2025-07-25
xadupre xadupre merged c3499d78 into main 170 days ago
xadupre xadupre deleted the xadupre/attention branch 170 days ago
snnn snnn removed release:1.23.0
snnn

Login to write a write a comment.

Login via GitHub