Skip to content

fix: accept multi-space xref subsection entries#799

Closed
vitormattos wants to merge 1 commit into
smalot:masterfrom
vitormattos:fix/xref-subsection-multi-space
Closed

fix: accept multi-space xref subsection entries#799
vitormattos wants to merge 1 commit into
smalot:masterfrom
vitormattos:fix/xref-subsection-multi-space

Conversation

@vitormattos
Copy link
Copy Markdown

@vitormattos vitormattos commented Apr 24, 2026

Summary

  • Fixes parsing of xref tables where subsection/entry fields use more than one space separator.
  • Prevents losing xref object map and avoids downstream Object list not found on valid but non-canonical files.
  • Adds integration fixture and regression test.

Reproduction

  • Fixture parsed by pdfinfo but parser produced Object list not found. Possible secured file..
  • Root cause: strict regex expected single-space separators in xref lines.

Changes

Signed-off-by: Vitor Mattos <1079143+vitormattos@users.noreply.github.com>
@vitormattos
Copy link
Copy Markdown
Author

This standalone PR has been restacked into the RawDataParser consolidation chain to keep a single merge path per source-file group.\n\nSuperseded-by chain:\n- upstream base: #796\n- fork consolidation into base: https://github.com/vitormattos/pdfparser/pull/28\n- stack continuation: https://github.com/vitormattos/pdfparser/pull/29\n\nClosing this standalone PR to avoid duplicate merge routes.

@vitormattos vitormattos deleted the fix/xref-subsection-multi-space branch April 27, 2026 17:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant