onnxruntime
4e670f7a
- Support larger hidden size in Attention Cuda kernel (#7002)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Support larger hidden size in Attention Cuda kernel (#7002) * Support larger hidden size in Attention Cuda kernel * Update attention_transpose.cu * review comments * fix typo and add check in quantization * update readme
References
#7002 - Support larger hidden size in Attention Cuda kernel
Author
wangyems
Parents
27ac8820
Loading