transformers
9c8979e3 - Word-level timestamps broken for short-form audio (#30325)

Commit

1 year ago

Word-level timestamps broken for short-form audio (#30325) * force chunk_length_s in AutomaticSpeechRecognitionPipeline * compute num_frames even when stride is None * add slow tests * fix test * Update src/transformers/pipelines/automatic_speech_recognition.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/pipelines/test_pipelines_automatic_speech_recognition.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add input validation * fixup * small fix --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

References

#30325 - Word-level timestamps broken for short-form audio

Author

kamilakesbi

Parents

4fda78c3

transformers 9c8979e3 - Word-level timestamps broken for short-form audio (#30325)

transformers
9c8979e3 - Word-level timestamps broken for short-form audio (#30325)