Skip to content
Change the repository type filter

All

    Repositories list

    • sm-umt

      Public
      Repository for Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs paper.
      Python
      0100Updated Mar 21, 2026Mar 21, 2026
    • palm

      Public
      Python
      73200Updated Mar 21, 2026Mar 21, 2026
    • simba

      Public
      Jupyter Notebook
      4510Updated Feb 12, 2026Feb 12, 2026
    • AfroScope

      Public
      Python
      0000Updated Feb 10, 2026Feb 10, 2026
    • afrolid

      Public
      AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
      Python
      Apache License 2.0
      113811Updated Feb 5, 2026Feb 5, 2026
    • InterPARES-Audio is an automated, end-to-end pipeline that transforms complex, lengthy audio files into structured, insightful text
      Jupyter Notebook
      0200Updated Dec 8, 2025Dec 8, 2025
    • An AI-powered conversational assistant designed to help users explore and understand InterPARES (International Research on Permanent Authentic Records in Electr…
      Python
      0000Updated Dec 4, 2025Dec 4, 2025
    • InterPARES-Vision is an advanced OCR (Optical Character Recognition) and layout analysis tool designed specifically for archival documents. It combines state-of…
      Python
      0000Updated Dec 4, 2025Dec 4, 2025
    • nilechat

      Public
      2700Updated Nov 11, 2025Nov 11, 2025
    • This repository contains the evaluation code and data for the PalmX 2025 Shared Task on Benchmarking LLMs for Arabic and Islamic Culture.
      Python
      1100Updated Sep 3, 2025Sep 3, 2025
    • Toucan

      Public
      1600Updated Sep 2, 2025Sep 2, 2025
    • sahara

      Public
      Benchmarking African NLP
      Python
      0200Updated Aug 18, 2025Aug 18, 2025
    • pearl

      Public
      An official repository for the paper “Pearl: A Multimodal, Culturally-Aware Arabic Instruction Dataset.”
      Python
      0600Updated May 29, 2025May 29, 2025
    • 0000Updated Mar 4, 2025Mar 4, 2025
    • uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes ( NAACL'2025 )
      Python
      MIT License
      0210Updated Feb 11, 2025Feb 11, 2025
    • SPARROW

      Public
      EMNLP 2023
      0300Updated Feb 7, 2025Feb 7, 2025
    • peacock

      Public
      This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
      22610Updated Dec 9, 2024Dec 9, 2024
    • Jupyter Notebook
      1910Updated Oct 10, 2024Oct 10, 2024
    • VioletV2

      Public
      Python
      MIT License
      0010Updated Sep 11, 2024Sep 11, 2024
    • Cheetah

      Public
      2400Updated Aug 12, 2024Aug 12, 2024
    • MoS

      Public
      Python
      GNU General Public License v3.0
      0000Updated Aug 7, 2024Aug 7, 2024
    • llmas

      Public
      Python
      Other
      0300Updated Aug 7, 2024Aug 7, 2024
    • copticmt

      Public
      Python
      1100Updated Jul 7, 2024Jul 7, 2024
    • A repo for Fumbling in Babel paper at NAACL2024: https://aclanthology.org/2024.findings-naacl.274/
      0000Updated Jul 5, 2024Jul 5, 2024
    • AraNet

      Public
      Python
      GNU General Public License v3.0
      82104Updated Jun 15, 2024Jun 15, 2024
    • HTML
      MIT License
      0000Updated Jun 11, 2024Jun 11, 2024
    • fintral

      Public
      0810Updated Jun 5, 2024Jun 5, 2024
    • araT5

      Public
      AraT5: Text-to-Text Transformers for Arabic Language Understanding
      2495131Updated May 16, 2024May 16, 2024
    • octopus

      Public
      Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)
      Python
      21010Updated Apr 29, 2024Apr 29, 2024
    • nadi

      Public
      Nuanced Arabic Dialect Identification Shared Tasks (NADI) 2020 and 2021
      Python
      2500Updated Mar 4, 2024Mar 4, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.