unstructured
fix doctype parsing error
#3811
Merged

fix doctype parsing error #3811

tbs17 merged 15 commits into main from tshen/fix-doctype-parsing-error
tbs17
tbs17 bump version to 0.16.3
6725ccc8
tbs17 merge main
8676b7de
tbs17 fix doctype parse error
18165661
tbs17 fix all the doctypes across classes in evaluate.py
aeab1b75
tbs17 updating test cases for the fix
e8450136
tbs17 remove large test pdf files
f5e6af6f
tbs17 remove the stem for other 2 doctype parsing
6aac8d1a
tbs17 use the correct input file to adjust
763da8a4
tbs17 remove original files
6e09661c
tbs17 add testing files for doctype parsing
08440641
tbs17 add test function
9b78ff37
tbs17 tbs17 requested a review from plutasnyy plutasnyy 1 year ago
tbs17 tbs17 requested a review from christinestraub christinestraub 1 year ago
plutasnyy
tbs17
tbs17 merge main
fb6e168b
tbs17 Merge remote-tracking branch 'origin/main' into tshen/fix-doctype-par…
9a234e0c
tbs17 tbs17 force pushed from 0b591c41 to 9a234e0c 1 year ago
tbs17 delete unneeded files
363db621
tbs17 use *.DS_Store
30f285fe
ajjimeno
ajjimeno approved these changes on 2024-12-06
badGarnet
badGarnet commented on 2024-12-06
tbs17 tbs17 merged 8c58bc57 into main 1 year ago
tbs17 tbs17 deleted the tshen/fix-doctype-parsing-error branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone