Fix Attention GQA implementation on CPU #25966
Fix Attention GQA implementation on CPU
28de5c54
add one more unit test
f407fd7a
negative infinity
7ff71da5
fix last unittest
6eb8ee93
fix merge conflicts
627fe07a
disable onnx test for attention
07745c42
xadupre
marked this pull request as ready for review 124 days ago
disable
542b5ab1
disable
08841deb
fix issues
db6df58d
disable
abb196e7
Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
805fe693
disable test waiting for onnx
59ded0ce
disable
5f93b472
disable
5c99cf03
inlien
18b0787f
tianleiwu
approved these changes
on 2025-09-13
justinchuby
deleted the xadupre/attentioncpu branch 119 days ago
snnn
removed release:1.23.1
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub