onnxruntime
042ff320
- Fix code review issues: use v_head_size and parameters.softcap
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
6 days ago
Fix code review issues: use v_head_size and parameters.softcap Co-authored-by: titaiwangms <18010845+titaiwangms@users.noreply.github.com>
References
#27082 - Support group query attention in Attention(23) CUDA
Author
Copilot
Parents
53c333fd
Loading