Add parsing HTML to unstructured elements #3732
Add parsing HTML to unstructured elements
e070ee56
Add page number and category depth
ae08a433
Fix BR tags, wrong span names and additional empty tags
27142a6a
Adjust test for new parser settings
74b0933f
pip compile
04534b4a
Do not add Document element
97269691
Remove dashes from ids
adb99600
Add support for filename
64780526
Add docstring
df2fedd0
Add partition_html
fdf176a3
Merge remote-tracking branch 'origin/main' into parsing-html-to-elements
dd68b9cc
Update requirements
016e851a
Fix unit tests for 3.9 and rename param
26c8714c
Fix if statement
6168a5f5
Fix paths in tests
4bc246db
plutasnyy
merged
03a3ed8d
into main 1 year ago
plutasnyy
deleted the parsing-html-to-elements branch 1 year ago
Login to write a write a comment.
Login via GitHub