onnxruntime
762703e0 - Support output cross qk, dtw and more for whisper model (#17500)

Commit

2 years ago

Support output cross qk, dtw and more for whisper model (#17500) Support cross qk in beam search for whisper model and related features Make whisper exporting tools support cross qk and some related features, * extra_decoding_ids * no_speech_prob Implement DTW kernel, unfold tensor kernel with unit test Several fix related with multiple session running parallel, like: * guard multihead_attention, fused_fp16_runner_ * some memory allocation with stream awareness * add use_ep_level_unified_stream option

References

#17500 - Support output cross qk, dtw and more for whisper model

Author

zhanghuanrong

Parents

c695de91

onnxruntime 762703e0 - Support output cross qk, dtw and more for whisper model (#17500)

onnxruntime
762703e0 - Support output cross qk, dtw and more for whisper model (#17500)