onnxruntime
f661c186
- Fix attention perf regression (#8682)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Fix attention perf regression (#8682) * undo change in attention cpu * fix perf regression * disable persistent softmax by default
References
#8682 - Fix attention perf regression
Author
tianleiwu
Parents
c2433524
Loading