transformers
Add Number Normalisation for SpeechT5
#25447
Merged

Add Number Normalisation for SpeechT5 #25447

tanaymeh
tanaymeh add: NumberNormalizer works for integers, floats, common currencies, …
2211a405
HuggingFaceDocBuilderDev
sgugger
tanaymeh fix: renamed number normalizer class and added normalization to Speec…
05ef5427
tanaymeh fix: restyled with black and ruff, should pass code quality tests
00f88928
tanaymeh tanaymeh marked this pull request as ready for review 2 years ago
sanchit-gandhi
sanchit-gandhi commented on 2023-08-14
tanaymeh fix: moved normalization to tokenizer and other small changes to norm…
ed2e0aa6
tanaymeh add: test for normalization and changed the existing full tokenizer test
28c1240f
tanaymeh fix: tokenization tests now pass, made changes to existing tokenizati…
afa90898
tanaymeh
sanchit-gandhi
sanchit-gandhi approved these changes on 2023-08-15
sanchit-gandhi sanchit-gandhi requested a review from ylacombe ylacombe 2 years ago
sanchit-gandhi sanchit-gandhi requested a review from ArthurZucker ArthurZucker 2 years ago
tanaymeh
tanaymeh fix: changed default normalize setting to False, modified the tests a…
a49f17c8
ArthurZucker
ArthurZucker commented on 2023-08-16
tanaymeh fix: added support for comma separated numbers, tokenization on the f…
682bee8d
tanaymeh
ArthurZucker
ArthurZucker approved these changes on 2023-08-18
tanaymeh
sanchit-gandhi
sanchit-gandhi commented on 2023-08-21
ArthurZucker ArthurZucker merged 182b8374 into main 2 years ago
tanaymeh tanaymeh deleted the add_normalization_speecht5 branch 2 years ago
ramkrishna757575
tanaymeh
ramkrishna757575
tanaymeh
ramkrishna757575
sanchit-gandhi
ramkrishna757575

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone