unstructured
feat: detect language for PDFs
#4051
Merged

feat: detect language for PDFs #4051

shreyanid merged 14 commits into main from pdf_miner_lang
shreyanid
shreyanid shreyanid changed the title feat: detect language with pdfminer text feat: detect language for PDFs 170 days ago
treighton
treighton approved these changes on 2025-07-11
shreyanid shreyanid requested a review from badGarnet badGarnet 170 days ago
shreyanid shreyanid marked this pull request as ready for review 168 days ago
shreyanid shreyanid force pushed from c623a596 to 6257e65e 167 days ago
shreyanid shreyanid force pushed from a036f8f0 to 10e5b3ea 167 days ago
shreyanid debugging
500054a4
shreyanid .
cffeb4e2
shreyanid detect lang; working
0a87cff9
shreyanid clean
9ece453b
shreyanid tidy
07989c4d
shreyanid changelog release version; add lang file
01b645ea
shreyanid version bump
b5655dfb
shreyanid update tests
fbd3849f
shreyanid add fr tests
67c3cd2a
shreyanid exclude languages metadata from ingest tests
3c794000
shreyanid dont exclude lang
edb09cd0
ryannikolaidis feat: detect language for PDFs <- Ingest test fixtures update (#4058)
2533d4ce
shreyanid remove detect lang param from api test
36ecb5ff
shreyanid shreyanid force pushed from 9744e8b3 to 36ecb5ff 167 days ago
shreyanid bump version
520b90cc
shreyanid shreyanid enabled auto-merge 167 days ago
shreyanid shreyanid merged 344202fa into main 167 days ago
shreyanid shreyanid deleted the pdf_miner_lang branch 167 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone