transformers
Add split special tokens
#30772
Merged

Add split special tokens #30772

itazap merged 21 commits into main from add_split_special_tokens
itazap
itazap itazap requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker
ArthurZucker commented on 2024-05-13
HuggingFaceDocBuilderDev
itazap
itazap commented on 2024-05-14
itazap itazap requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker ArthurZucker force pushed 1 year ago
ArthurZucker ArthurZucker force pushed 1 year ago
ArthurZucker ArthurZucker force pushed to 9a65f9f7 1 year ago
ArthurZucker
ArthurZucker commented on 2024-05-14
ArthurZucker ArthurZucker force pushed to 844202b6 1 year ago
itazap itazap marked this pull request as ready for review 1 year ago
itazap itazap requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker
ArthurZucker commented on 2024-05-15
itazap itazap requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker
ArthurZucker approved these changes on 2024-05-24
itazap itazap closed this 1 year ago
itazap itazap force pushed to 42d8dd87 1 year ago
itazap itazap reopened this 1 year ago
ArthurZucker seems like `split_special_tokens` is used here
56fd608d
split special token
fbb144cb
add new line at end of file
abeaeb4e
moving split special token test to common tests
99580a9d
added assertions
a7ebe7da
ArthurZucker test
224aae47
ArthurZucker fixup
80adc63c
itazap add co-author
6dbe8789
passing rest of args to gptsan_japanese, fixing tests
4000f635
removing direct comparison of fast and slow models
aec5f710
adding test support for UDOP and LayoutXLM
88b12aa6
ruff fix
fd5d1fef
readd check if slow tokenizer
bb64b8d4
modify test to handle bos tokens
24473ace
removing commented function
fb321f6b
trigger build
fd357bd6
applying review feedback - updated docstrings, var names, and simplif…
34cdba68
ruff fixes
29af7206
itazap Update tests/test_tokenization_common.py
80b4e77b
applying feedback, comments
f5bf1093
ArthurZucker ArthurZucker force pushed to f5bf1093 1 year ago
shutil temp directory fix
2ce75694
itazap itazap merged deba7655 into main 1 year ago
itazap itazap deleted the add_split_special_tokens branch 1 year ago
itazap itazap restored the head branch 1 year ago
itazap itazap deleted the add_split_special_tokens branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone