onnxruntime
8d78f96d
- [CUDA] Fuse add bias and transpose into one kernel in Attention (#12670)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
[CUDA] Fuse add bias and transpose into one kernel in Attention (#12670) * fuse add bias and transpose in attention
References
#12670 - [CUDA] Fuse add bias and transpose into one kernel in Attention
Author
tianleiwu
Parents
6246662b
Loading