onnxruntime
2e108741
- Enable CUDA tests for GQA attention tests
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 days ago
Enable CUDA tests for GQA attention tests Co-authored-by: titaiwangms <18010845+titaiwangms@users.noreply.github.com>
References
#27082 - Support group query attention in Attention(23) CUDA
Author
Copilot
Parents
0e7a632d
Loading