onnxruntime
762703e0 - Support output cross qk, dtw and more for whisper model (#17500)

Commit
2 years ago
Support output cross qk, dtw and more for whisper model (#17500) Support cross qk in beam search for whisper model and related features Make whisper exporting tools support cross qk and some related features, * extra_decoding_ids * no_speech_prob Implement DTW kernel, unfold tensor kernel with unit test Several fix related with multiple session running parallel, like: * guard multihead_attention, fused_fp16_runner_ * some memory allocation with stream awareness * add use_ep_level_unified_stream option
Author
Parents
Loading