onnxruntime
optimization for whisper model with decoder masked multihead attention
#15827
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
7
Changes
View On
GitHub
Commits
Support whisper model to use MaskedDecoderMultiheadAttention
zhanghuanrong
committed
2 years ago
fix py lint warning
zhanghuanrong
committed
2 years ago
fix change fail unit test
zhanghuanrong
committed
2 years ago
Fix som pr feedback
zhanghuanrong
committed
2 years ago
Fix bug caused by remove optional attention mask
zhanghuanrong
committed
2 years ago
Fix bug during adding initial decoder ids
zhanghuanrong
committed
2 years ago
docs update
zhanghuanrong
committed
2 years ago
Loading