Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      Other
      0000Updated Apr 24, 2026Apr 24, 2026
    • CLDF dataset accompanying Pache's "Lexical Parallels in South American Languages" from 2023
      TeX
      Creative Commons Attribution 4.0 International
      0000Updated Apr 22, 2026Apr 22, 2026
    • ideobank

      Public
      TeX
      Creative Commons Attribution 4.0 International
      0010Updated Apr 16, 2026Apr 16, 2026
    • CLDF dataset derived from Kitchen et al.'s "Bayesian phylogenetic analysis of Semitic languages" from 2009
      TeX
      Other
      1000Updated Mar 24, 2026Mar 24, 2026
    • CLDF dataset derived from Ugarte et al.'s "NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru (forthcoming)
      Python
      Creative Commons Attribution 4.0 International
      0230Updated Mar 24, 2026Mar 24, 2026
    • TeX
      Creative Commons Attribution 4.0 International
      1000Updated Mar 24, 2026Mar 24, 2026
    • CLDF dataset for data on Andean languages
      TeX
      Creative Commons Attribution 4.0 International
      1000Updated Mar 23, 2026Mar 23, 2026
    • tlopo

      Public
      The lexicon of Proto Oceanic
      TeX
      Creative Commons Attribution 4.0 International
      0030Updated Mar 3, 2026Mar 3, 2026
    • CLDF dataset derived from Khalid's "Grammatical Sketch of Asur" from 2020
      Python
      Creative Commons Attribution 4.0 International
      0000Updated Feb 25, 2026Feb 25, 2026
    • CLDF dataset derived from Chacon's "Revised Proposal of Proto-Tukanoan consonants" from 2014
      Python
      Creative Commons Attribution 4.0 International
      0100Updated Feb 17, 2026Feb 17, 2026
    • The python curation library for lexibank
      Python
      Apache License 2.0
      72140Updated Feb 12, 2026Feb 12, 2026
    • uralex

      Public
      UraLex basic vocabulary dataset
      TeX
      Creative Commons Attribution 4.0 International
      6450Updated Feb 10, 2026Feb 10, 2026
    • Walker and Ribeiro (2011) Arawakan dataset
      Python
      Creative Commons Attribution 4.0 International
      0000Updated Jan 6, 2026Jan 6, 2026
    • CLDF dataset derived from Lundgren's "Phonological Reconstruction of Proto-Omagua–Kokama–Tupinambá" from 2020
      Python
      Creative Commons Attribution 4.0 International
      0010Updated Dec 22, 2025Dec 22, 2025
    • CLDF dataset derived from Dunn and Tresoldi's "IELex Data and Tree" from 2021
      Python
      Creative Commons Attribution 4.0 International
      1021Updated Dec 21, 2025Dec 21, 2025
    • Sound-Comparisons Vanuatu
      Python
      Other
      1330Updated Oct 29, 2025Oct 29, 2025
    • abvd

      Public
      CLDF dataset derived from Greenhill et al.'s "Austronesian Basic Vocabulary Database" from 2020.
      TeX
      Creative Commons Attribution 4.0 International
      3400Updated Oct 7, 2025Oct 7, 2025
    • Python
      Apache License 2.0
      0100Updated Aug 20, 2025Aug 20, 2025
    • pytlopo

      Public
      Python
      Apache License 2.0
      0000Updated Aug 15, 2025Aug 15, 2025
    • asjp

      Public
      CLDF dataset derived from Wichmann et al.'s "ASJP Database"
      TeX
      Creative Commons Attribution 4.0 International
      2150Updated Aug 4, 2025Aug 4, 2025
    • CLDF dataset derived from the Rutul Basic Lexicon
      Python
      Creative Commons Attribution 4.0 International
      1100Updated Jun 23, 2025Jun 23, 2025
    • CLDF dataset derived from Greenhill's "TransNewGuinea.org" from 2015
      TeX
      Creative Commons Attribution 4.0 International
      2020Updated Jun 23, 2025Jun 23, 2025
    • leecaijia

      Public
      CLDF dataset derived from Lee's "Phonological features of Caijia" from 2022
      Python
      Creative Commons Attribution 4.0 International
      0000Updated Jun 18, 2025Jun 18, 2025
    • CLDF dataset derived from Chacon's "Annotated Swadesh Lists for Tukanoan Languages" from 2017
      Python
      Creative Commons Attribution 4.0 International
      1000Updated Jun 11, 2025Jun 11, 2025
    • cals

      Public
      CLDF dataset derived from Mennecier et al.'s "Central Asian Language Survey" from 2016
      Python
      Creative Commons Attribution 4.0 International
      0010Updated Jun 11, 2025Jun 11, 2025
    • dravlex

      Public
      CLDF dataset derived from Kolipakam et al.'s "DravLex:" from 2018.
      Python
      Creative Commons Attribution 4.0 International
      1000Updated Jun 11, 2025Jun 11, 2025
    • CLDF dataset derived from Lee's "Sketch of Language History in the Korean Peninsula" from 2015
      Python
      Creative Commons Attribution 4.0 International
      1000Updated Jun 11, 2025Jun 11, 2025
    • CLDF dataset derived from McElhanon's "Preliminary Observations on Huon Peninsula Languages" from 1967
      Python
      Creative Commons Attribution 4.0 International
      0000Updated Jun 11, 2025Jun 11, 2025
    • ala

      Public
      Automated Language Affiliation Pipeline for Lexibank
      Python
      MIT License
      0000Updated Jun 5, 2025Jun 5, 2025
    • Study on lexibank data (presenting the lexibank dataset).
      TeX
      Creative Commons Attribution 4.0 International
      41510Updated Apr 11, 2025Apr 11, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.