onnxruntime
optimization for whisper model with decoder masked multihead attention
#15827

Merged

Commits

Support whisper model to use MaskedDecoderMultiheadAttention

zhanghuanrong committed 2 years ago
fix py lint warning

zhanghuanrong committed 2 years ago
fix change fail unit test

zhanghuanrong committed 2 years ago
Fix som pr feedback

zhanghuanrong committed 2 years ago
Fix bug caused by remove optional attention mask

zhanghuanrong committed 2 years ago
Fix bug during adding initial decoder ids

zhanghuanrong committed 2 years ago
docs update

zhanghuanrong committed 2 years ago