onnxruntime
optimization for whisper model with decoder masked multihead attention
#15827
Merged

Commits
  • Support whisper model to use MaskedDecoderMultiheadAttention
    zhanghuanrong committed 2 years ago
  • fix py lint warning
    zhanghuanrong committed 2 years ago
  • fix change fail unit test
    zhanghuanrong committed 2 years ago
  • Fix som pr feedback
    zhanghuanrong committed 2 years ago
  • Fix bug caused by remove optional attention mask
    zhanghuanrong committed 2 years ago
  • Fix bug during adding initial decoder ids
    zhanghuanrong committed 2 years ago
  • docs update
    zhanghuanrong committed 2 years ago
Loading