Skip to content
Change the repository type filter

All

    Repositories list

    • PageRLib

      Public
      Python
      0000Updated Apr 27, 2026Apr 27, 2026
    • GUTS

      Public
      A repository containing a dataset based on two existing datasets: Pubtables1m and REalheatBench. Designed for work in the field of table structure recognition.
      MIT License
      0100Updated Apr 27, 2026Apr 27, 2026
    • Jupyter Notebook
      0000Updated Apr 24, 2026Apr 24, 2026
    • rag_test

      Public
      Python
      Apache License 2.0
      0000Updated Apr 21, 2026Apr 21, 2026
    • PageR

      Public
      Python
      Apache License 2.0
      0700Updated Apr 20, 2026Apr 20, 2026
    • Python
      0200Updated Apr 13, 2026Apr 13, 2026
    • Python
      Apache License 2.0
      0020Updated Apr 8, 2026Apr 8, 2026
    • Python
      0000Updated Apr 7, 2026Apr 7, 2026
    • Jupyter Notebook
      0000Updated Apr 7, 2026Apr 7, 2026
    • Python
      0000Updated Apr 6, 2026Apr 6, 2026
    • Jupyter Notebook
      Apache License 2.0
      0000Updated Apr 2, 2026Apr 2, 2026
    • Precision PDF is a high-precision Java-based PDF parsing and extraction framework built on top of Apache PDFBox. It provides structured content extraction with …
      Java
      1000Updated Apr 1, 2026Apr 1, 2026
    • Russian-Facts-200 (RF-200): A novel benchmark for fact extraction from Russian tabular data.
      Jupyter Notebook
      MIT License
      1200Updated Feb 5, 2026Feb 5, 2026
    • Python
      0000Updated Feb 3, 2026Feb 3, 2026
    • FontEmb

      Public
      Jupyter Notebook
      0000Updated Feb 2, 2026Feb 2, 2026
    • Java
      0010Updated Dec 4, 2025Dec 4, 2025
    • Jupyter Notebook
      0000Updated Oct 23, 2025Oct 23, 2025
    • A web-based application (client) for Semantic Table Linker.
      PHP
      MIT License
      0000Updated Oct 15, 2025Oct 15, 2025
    • A Java tool for modifying character mappings (CMAP) in PDF documents using Apache PDFBox library. This utility allows batch processing of PDF files with custom …
      Java
      Apache License 2.0
      0000Updated Oct 14, 2025Oct 14, 2025
    • Semantic Table Linker.
      Python
      MIT License
      0000Updated Oct 12, 2025Oct 12, 2025
    • Script for sending and processing JSON messages to Talisman framework.
      Python
      MIT License
      0000Updated Oct 1, 2025Oct 1, 2025
    • Module for scraping photos from map web-pages
      Python
      0000Updated Aug 5, 2025Aug 5, 2025
    • Jupyter Notebook
      Apache License 2.0
      0000Updated Jul 23, 2025Jul 23, 2025
    • wordGLAM

      Public
      Jupyter Notebook
      0000Updated Jul 21, 2025Jul 21, 2025
    • OmniGraph

      Public
      Apache License 2.0
      0100Updated Jul 1, 2025Jul 1, 2025
    • T5-GlyF

      Public
      Jupyter Notebook
      0000Updated Jun 18, 2025Jun 18, 2025
    • JavaScript
      2200Updated Jun 9, 2025Jun 9, 2025
    • IIC-doc

      Public
      Jupyter Notebook
      0000Updated Jun 8, 2025Jun 8, 2025
    • Python
      0200Updated May 30, 2025May 30, 2025
    • Python
      0100Updated May 29, 2025May 29, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.