llm-foundry
Default to using tokenizer eos and bos in convert_text_to_mds.py
#823
Merged

Default to using tokenizer eos and bos in convert_text_to_mds.py #823

irenedea merged 2 commits into mosaicml:main from irenedea:tok
irenedea
irenedea default to using tokenizer eos and bos
754fe7fc
irenedea pyright fixes
50588ac0
Skylion007
Skylion007 approved these changes on 2023-12-28
Skylion007
irenedea irenedea merged 5eb96e92 into main 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone