onnxruntime
d530b290 - Fix Attention GQA implementation on CPU (#25966)

Commit
106 days ago
Fix Attention GQA implementation on CPU (#25966) ### Description Attention on CPU is following ONNX specifications. This change replicates the changes introduced by https://github.com/onnx/onnx/pull/7274.
Author
Parents
Loading