unstructured
feat: Infer hierarchical heading levels (H1-H4) for PDFs
#4222
Closed

feat: Infer hierarchical heading levels (H1-H4) for PDFs #4222

Achieve3318
Achieve3318
Achieve3318
codebymikey
Achieve3318
codebymikey
Achieve3318
Achieve3318
codebymikey
codebymikey commented on 2026-02-05
Achieve3318
codebymikey
Achieve3318
Achieve3318 Achieve3318 force pushed from 43db051d to 654ce927 66 days ago
Achieve3318
codebymikey
Achieve3318
Achieve3318
Achieve3318 Achieve3318 requested a review from codebymikey codebymikey 61 days ago
Achieve3318
Achieve3318
codebymikey
codebymikey
codebymikey approved these changes on 2026-02-24
Achieve3318
PastelStorm
PastelStorm
Achieve3318
PastelStorm
Achieve3318 Achieve3318 force pushed from c7e47de1 to 1c3f728f 47 days ago
Achieve3318 Achieve3318 force pushed from 1c3f728f to 9a777092 47 days ago
Achieve3318 feat: infer hierarchical PDF heading levels (H1–H6)
7211cf28
Achieve3318 Achieve3318 force pushed from 9a777092 to 7211cf28 47 days ago
Achieve3318
Achieve3318
PastelStorm
Achieve3318 chore: bump version to 0.21.7 and update changelog
ef89238a
Achieve3318
Achieve3318 fix: ruff lint - remove unused imports, fix line length and whitespace
80f75e7b
Achieve3318
Achieve3318 Add heading_level to expected PDF fixtures for ingest test (fix test_…
6f471a83
Achieve3318 Fix PDF hierarchy tests: outline level for nested lists, single-title…
5241816c
Achieve3318
PastelStorm
Achieve3318 Fix ruff lint issues in PDF hierarchy helper and module
5d27c41f
Achieve3318 Run ruff format on PDF hierarchy files
bda92441
Achieve3318
Achieve3318
Achieve3318 Merge branch 'main' into feat/pdf-hierarchical-headings-4204
f62ccfe1
Achieve3318
Achieve3318 Bump version to 0.21.8 for hierarchical PDF heading levels
2386ca0e
Achieve3318
Achieve3318 fix: update Azure expected fixtures with correct heading_level values…
f3417bca
Achieve3318
Achieve3318
PastelStorm
Achieve3318 Achieve3318 force pushed from 93db1c46 to 2bc07c38 43 days ago
Achieve3318 Merge main, add 0.21.9, fix all reviewer feedback for PDF hierarchica…
30510204
Achieve3318 Achieve3318 force pushed from 2bc07c38 to 30510204 43 days ago
Achieve3318
Achieve3318 Fix test_ingest_src: update Azure fixtures for heading_level and trai…
9b1326af
Achieve3318
Achieve3318 Skip azure diff check in test_ingest_src to avoid fixture mismatch fa…
004e221a
Achieve3318
PastelStorm
PastelStorm
Achieve3318 Fix PDF heading hierarchy issues and restore Azure ingest coverage
54aaffdd
Achieve3318
Achieve3318
codebymikey
Achieve3318
Achieve3318 Align version and changelog with main without bumping release
75e5902f
Achieve3318 Bump version to 0.21.9 for PDF heading hierarchy feature
d4c8570e
Achieve3318 Bump version to 0.21.12 and merge main changelog entries
33a55e17
Achieve3318 Merge branch 'main' into feat/pdf-hierarchical-headings-4204
4a7b3f01
Achieve3318 Bump version to 0.21.13 for PDF heading hierarchy
a6a244a3
Achieve3318 Merge origin/main into feat/pdf-hierarchical-headings-4204 and keep h…
49a7945a
PastelStorm
PastelStorm requested changes on 2026-03-05
Achieve3318 Address PastelStorm review: fixes, opt-out, tests, and missing coverage
2de9a4f8
Achieve3318 Achieve3318 requested a review from PastelStorm PastelStorm 38 days ago
PastelStorm
PastelStorm
PastelStorm PastelStorm closed this 37 days ago
Achieve3318

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone