onnxruntime
c65e8920
- [CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch (#17613)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
[CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch (#17613)
References
#17613 - [CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch
Author
hariharans29
Parents
e6301eee
Loading