[Whisper] Fix whisper tokenizer #34537
handle single timestamp ending
751f88ab
include last timestamp token
000dccdb
handle single timestamp ending
70c8aaca
Merge branch 'huggingface:main' into fix-whispertokenizer
3f9f7eab
eustlb
changed the title Fix whispertokenizer [Whisper] Fix whisper tokenizer 1 year ago
eustlb
changed the title [Whisper] Fix whisper tokenizer [WPI] [Whisper] Fix whisper tokenizer 1 year ago
eustlb
changed the title [WPI] [Whisper] Fix whisper tokenizer [WIP] [Whisper] Fix whisper tokenizer 1 year ago
avoid floating points arithm limitations
e7532066
ensure float64 operations
c53fb2ce
new test
185fb555
eustlb
commented
on 2024-10-31
make fixup
429904c0
make copies
1c722443
Merge branch 'main' into fix-whispertokenizer
ad4f3553
eustlb
marked this pull request as ready for review 1 year ago
eustlb
changed the title [WIP] [Whisper] Fix whisper tokenizer [Whisper] Fix whisper tokenizer 1 year ago
Merge branch 'main' into fix-whispertokenizer
09af9de0
handle edge case double tokens ending with different tokens
7d6f9b47
eustlb
force pushed
to
7d6f9b47
1 year ago
handle single timestamp ending
937cd2a1
make fixup
9b1d51ed
handle conditioning on prev segments
a3cbe9f4
fix
7c0da36b
ylacombe
approved these changes
on 2024-11-21
Update src/transformers/models/whisper/generation_whisper.py
4a21249a
Merge branch 'main' into fix-whispertokenizer
5b195a88
[run-slow] whisper
e8f2f690
Merge branch 'main' into fix-whispertokenizer
d71d40a2
Merge branch 'main' into fix-whispertokenizer
961d5f69
Merge branch 'main' into fix-whispertokenizer
e04aa926
Merge branch 'main' into fix-whispertokenizer
69abfe07
Merge branch 'main' into fix-whispertokenizer
16b1ecb5
don't call item() to avoid unnecessary sync
5fba3e0f
Merge branch 'main' into fix-whispertokenizer
9624ce9f
fix
88587bb5
eustlb
merged
54aae121
into main 1 year ago
alubbe
commented
on 2024-12-10
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub