unstructured
Kravetsmic/1004 add the ability to ignore <header> and <footer> tags in partition html
#1013
Merged

Kravetsmic/1004 add the ability to ignore <header> and <footer> tags in partition html #1013

kravetsmic
kravetsmic don't push
ce26a827
Coniferish enhancement: improve json detection by detect_filetype (#971)
f911fdd5
MthwRobinson refactor: simplifies JSON detection and add tests (#975)
63076ba9
potter-potter feat: adds Outlook connector (#939)
99841427
rbiseck3 Roman/expose dpi param (#966)
c9f228a7
MthwRobinson chore: cleanup changelog for 0.8.2 (#976)
70c0715b
shreyanid Update `partition_via_api` to not post a strategy value if not user s…
be675a5b
cragwolfe build(release): cut 0.8.4 release (#979)
c587720c
MthwRobinson feat: add document date for remaining file types (#930) (#969)
708795a8
yuming-long Chore: add uns api repo unittests (#954)
424a8932
MthwRobinson fix: handling for empty tables in word docs and powerpoints (#982)
514ad533
MthwRobinson fix: only download nltk packages if necessary (#985)
3f0d1fe5
yuming-long Chore: Pass table support param to partition image (#973)
ce17cb89
shreyanid Update pip in makefile (#981)
bb744b13
ryannikolaidis chore: remove debug printing (#988)
29154387
MthwRobinson fix: correct nltk download arg order (#991)
eaa91bca
yuming-long Chore: put back function `split_by_paragraph` (#992)
e5ff2f38
kravetsmic don't push
825ad8a5
kravetsmic fix: clean up code
f73bab58
kravetsmic fix: clean up
56621374
kravetsmic fix: clean up
969c92d6
MthwRobinson feat: add document date for remaining file types (#930) (#969)
8965e48d
rbiseck3 Roman/ingest refactor (#978)
e7583365
potter-potter feat: adds Box connector (#996)
b7dc8417
cragwolfe chore: rename Element's "date" field to "last_modified" (#997)
ec172302
kravetsmic don't push
1fb22b12
MthwRobinson feat: add document date for remaining file types (#930) (#969)
81f5ec79
MthwRobinson feat: add document date for remaining file types (#930) (#969)
df205a09
kravetsmic fix: removie prints
931f4a1c
kravetsmic remove unused file
ad8a80ca
kravetsmic Merge remote-tracking branch 'upstream/main' into main
c6fae4d7
kravetsmic feat: add skip_footers_and_headers parameter
8c4a914a
kravetsmic feat: crate file for testing
c4e6c05a
kravetsmic feat: add testing for skip_footers_and_headers prm
f4ac1427
kravetsmic feat: update CHANGELOG
7048f43f
kravetsmic fix: remove unused import
a4057f57
MthwRobinson
kravetsmic
MthwRobinson Merge branch 'main' into kravetsmic/1004-Add-the-ability-to-ignore-<h…
23521c4a
MthwRobinson changelog and version
6c501806
MthwRobinson update docs
306ec4d0
MthwRobinson linting, linting, linting
ccb50e61
MthwRobinson
MthwRobinson approved these changes on 2023-08-04
MthwRobinson MthwRobinson enabled auto-merge (squash) 2 years ago
MthwRobinson MthwRobinson merged 25ca5744 into main 2 years ago
huangpan2507

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone