transformers
[`tokenizers`] Ensure that add_prefix_space is propagated to backend_tokenizer.pre_tokenizer
#35593
Merged

[`tokenizers`] Ensure that add_prefix_space is propagated to backend_tokenizer.pre_tokenizer #35593

tomaarsen
tomaarsen Ensure that add_prefix_space is propagated to backend_tokenizer.pre_t…
e8738e5d
tomaarsen tomaarsen requested a review from Rocketknight1 Rocketknight1 1 year ago
tomaarsen tomaarsen requested a review from ArthurZucker ArthurZucker 1 year ago
tomaarsen Simplify setting self.add_prefix_space, ensure pre_tok exists
8df9846a
tomaarsen Wrap in try-except to catch 'Custom PreTokenizer cannot be serialized'
aaffcb47
HuggingFaceDocBuilderDev
tomaarsen Propagate add_prefix_space in T5TokenizerFast to superclass
9888d790
ArthurZucker
ArthurZucker approved these changes on 2025-01-09
tomaarsen tomaarsen merged 32e0db8a into main 1 year ago
KoichiYasuoka
ArthurZucker

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone