onnxruntime
d530b290
- Fix Attention GQA implementation on CPU (#25966)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
106 days ago
Fix Attention GQA implementation on CPU (#25966) ### Description Attention on CPU is following ONNX specifications. This change replicates the changes introduced by https://github.com/onnx/onnx/pull/7274.
References
#25966 - Fix Attention GQA implementation on CPU
Author
xadupre
Parents
21325309
Loading