onnxruntime
53d304d4 - optimize gated gru cuda kernel (#15525)

Commit
2 years ago
optimize gated gru cuda kernel (#15525) ### Description <!-- Describe your changes. --> Improvement with Tulrv6 on A100 ![image](https://user-images.githubusercontent.com/52801275/232602055-518726da-3a9a-4e2e-8def-2cd855c8225d.png) ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> --------- Co-authored-by: Ubuntu <wy@v100-2.0cdb2e52twzevn1i4fi45bylyg.jx.internal.cloudapp.net>
Author
Parents
Loading