onnxruntime
efdc783d
- make SplitPackedQKV work on GQA
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
15 days ago
make SplitPackedQKV work on GQA
References
#26850 - [webgpu] Optimize AttentionPrepare
Author
qjia7
Committer
qjia7
Parents
0067a2d9
Loading