onnxruntime
optimization for whisper model with decoder masked multihead attention
#15827
Merged

Loading