Skip to content

Latest commit

 

History

History
81 lines (63 loc) · 2.62 KB

File metadata and controls

81 lines (63 loc) · 2.62 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased]

Added

  • Debug mode with visual overlay generation
  • Professional documentation and examples
  • MIT License and contributing guidelines
  • Modern Python packaging with pyproject.toml
  • Comprehensive error handling for PDF parsing
  • Improved table detection algorithms
  • Enhanced figure detection with better filtering

Changed

  • Improved accuracy for table detection
  • More robust error handling for malformed PDFs
  • Better figure filtering to reduce false positives
  • Enhanced debug visualization with color-coded overlays

Fixed

  • Fixed "list index out of range" errors in table detection
  • Resolved MuPDF format errors with proper exception handling
  • Improved grid extraction with better validation
  • Fixed figure detection parameters for better accuracy

[1.0.0] - 2024-01-XX

Added

  • Initial release of PDF Layout Analysis Engine
  • Multi-column layout detection
  • Figure and table extraction
  • Text block analysis and grouping
  • Caption linking functionality
  • Command-line interface
  • Python API for integration
  • Support for both ruled and borderless tables
  • Vector graphics and image object detection
  • Reading order determination
  • Batch processing capabilities

Features

  • Column Detection: Automatic multi-column layout recognition using whitespace analysis
  • Figure Extraction: Vector clustering and image XObject detection
  • Table Recognition: Hough line transform for ruled tables, cell density analysis for borderless tables
  • Text Processing: Connected component analysis with hierarchical grouping
  • Debug Mode: Visual overlay generation for layout analysis
  • Error Handling: Robust processing of various PDF formats and edge cases

Technical Details

  • Built with PyMuPDF for PDF processing
  • Uses OpenCV for computer vision operations
  • Implements DBSCAN clustering for vector graphics
  • Applies Hough transform for line detection
  • Uses projection profile analysis for column detection
  • Supports Python 3.8+ with comprehensive type hints

Version History

v1.0.0 (Initial Release)

  • Core layout analysis functionality
  • Figure and table detection
  • Multi-column processing
  • Command-line interface
  • Python API

Contributing

See CONTRIBUTING.md for details on how to contribute to this project.

License

This project is licensed under the MIT License - see the LICENSE file for details.