langchain
text-splitters: Fix regex separator merge bug in CharacterTextSplitter
#31137
Merged

text-splitters: Fix regex separator merge bug in CharacterTextSplitter #31137

suminnnnn
suminnnnn fix(text-splitters): prevent regex separators from being reinserted o…
41cd3761
suminnnnn add unit test to ensure regex separators aren’t reinserted on merge
34ef4e1e
vercel
dosubot dosubot added size:M
dosubot dosubot added Ɑ: text splitters
dosubot dosubot added bug
suminnnnn test: ensure newline at end of regex-merge test
52cdb871
ccurme
ccurme commented on 2025-05-08
suminnnnn fix(text-splitters): fix split_text merge logic to skip re-insertion …
fe6263c4
suminnnnn add parametrized cases for lookaround and literal separator chunking …
b29c31c5
suminnnnn fix line-length lint errors in split_text
87395df7
suminnnnn remove unnecessary indent
860ad02d
suminnnnn remove unnecessary indent
778a9c96
suminnnnn add a missing type annotation to function
80e9be00
suminnnnn
suminnnnn suminnnnn requested a review from ccurme ccurme 285 days ago
ccurme
ccurme approved these changes on 2025-05-10
dosubot dosubot added lgtm
ccurme ccurme merged 683da2c9 into master 283 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone