unstructured
Klaijan/auto paragraph grouper
#994
Merged

Klaijan/auto paragraph grouper #994

Klaijan merged 27 commits into main from klaijan/auto_paragraph_grouper
Klaijan
add auto_paragraph_grouper. add line break pattern.
acc2303d
combine group_broken_paragraph and blank_line_grouper function
92bb0f0a
fix make check errors
3889628b
fix make check errors
def7d6e3
fix make check errors
c9cf970d
fix make check errors
7810a23f
Klaijan Merge branch 'main' into klaijan/auto_paragraph_grouper
2d59008a
run make tidy to fix errors
593a59d8
tidy core.py and text.py
70171fc1
fix blank-line breaker to extends the result and replace new line wit…
7a04c78e
fix function name typo
142378c7
call group_broken_paragraphs for blank_line_grouper
8fb93b7a
cragwolfe
cragwolfe commented on 2023-08-02
edit function name from one_line_grouper to new_line_grouper for cons…
b07e9718
cragwolfe
cragwolfe approved these changes on 2023-08-03
cragwolfe
cragwolfe commented on 2023-08-04
Klaijan Merge branch 'main' into klaijan/auto_paragraph_grouper
2aedd795
Klaijan edit threshold from 0.5 to 0.1
fffc8c2c
Klaijan edit threshold from 0.5 to 0.1
1500e4b7
Klaijan Merge branch 'klaijan/auto_paragraph_grouper' of https://github.com/U…
9df86b41
Klaijan Revert "call group_broken_paragraphs for blank_line_grouper"
8b5f7c70
Klaijan revert to commit 8fb93b7 and change threshold from 0.5 to 0.1
55c6568c
Klaijan Merge branch 'main' into klaijan/auto_paragraph_grouper
bf3a6a66
Klaijan Merge branch 'main' into klaijan/auto_paragraph_grouper
b85d052a
Klaijan Merge branch 'main' into klaijan/auto_paragraph_grouper
d22dedda
Klaijan edit test_text assertion. remove all BULLETS_PATTERN.
4f13c497
ryannikolaidis Update ingest test fixtures (#1052)
37eb9a94
ahmetmeleq
Klaijan edit test case in test_xml_partition
0723e8d7
Klaijan Merge branch 'klaijan/auto_paragraph_grouper' of https://github.com/U…
3d5482e5
Klaijan update assertion on test_auto
bfe92607
Klaijan Klaijan merged ad386af8 into main 2 years ago
Klaijan Klaijan deleted the klaijan/auto_paragraph_grouper branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone