unstructured
fix: pdfminer drops extractable text
#4310
Merged

fix: pdfminer drops extractable text #4310

qued merged 18 commits into main from fix/pdfminer-drops-extractable-text
qued
qued Add test for embedded cmap text
cf230145
qued monkeypatch pdfminer
96995e89
qued update version and changelog
42ce371e
qued Merge branch 'main' into fix/pdfminer-drops-extractable-text
e4b83b21
cursor
cursor commented on 2026-03-31
qued formatting
0d4b72f7
qued version
0ca0f56c
qued fix: cap CMap range expansion to prevent DoS from crafted PDFs
18a88ff9
cursor
cursor commented on 2026-03-31
qued fix: use total_mappings counter for CMap DoS cap
23fdb00c
cursor
cursor commented on 2026-03-31
qued fix: harden CMap parser against reversed ranges and indirect refs
446c6f5d
qued Add test fixture
c8a1e34d
qued subclass instead of monkeypatch
a6debcaa
qued linty
278f5252
badGarnet
badGarnet commented on 2026-04-01
badGarnet
badGarnet approved these changes on 2026-04-01
qued add hi_res test
ef8896de
qued Add more test cases
a326c2b2
qued fix: add stream size cap and WMode support to CMap parser
1f078b48
qued format
1e109e1e
qued Add more security tests for CustomPDFResourceManager
229c400c
qued refactor: resolve embedded CMap at font construction time
4326b15f
cragwolfe
cragwolfe approved these changes on 2026-04-01
qued qued merged 6ada488f into main 82 days ago
qued qued deleted the fix/pdfminer-drops-extractable-text branch 82 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone