Skip to content
Change the repository type filter

All

    Repositories list

    • MedExpert

      Public
      Code for the "MedExpert: An Expert-Annotated Dataset for Medical Chatbot Evaluation" paper at Machine Learning for Health (ML4H) 2025.
      Python
      MIT License
      0700Updated Apr 18, 2026Apr 18, 2026
    • ManyIH

      Public
      Python
      MIT License
      0900Updated Apr 13, 2026Apr 13, 2026
    • Python
      0200Updated Apr 3, 2026Apr 3, 2026
    • Helps us monitor the GPUs for the IA-lab
      TypeScript
      1000Updated Mar 19, 2026Mar 19, 2026
    • SciTaRC

      Public
      Python
      0100Updated Mar 11, 2026Mar 11, 2026
    • NSF CCRI ENS Project: Next Generation Tools for Spoken Language Science & Technology
      1000Updated Feb 24, 2026Feb 24, 2026
    • Python
      4400Updated Feb 9, 2026Feb 9, 2026
    • mmBERT

      Public
      A massively multilingual modern encoder language model
      Python
      1014020Updated Jan 20, 2026Jan 20, 2026
    • al-qasida

      Public
      Python
      0100Updated Jan 5, 2026Jan 5, 2026
    • Python
      2500Updated Dec 15, 2025Dec 15, 2025
    • HTML
      0000Updated Dec 10, 2025Dec 10, 2025
    • Essential code for the paper *Genomic Next-Token Predictors are In-Context Learners*.
      Python
      1400Updated Nov 16, 2025Nov 16, 2025
    • Python
      1200Updated Nov 5, 2025Nov 5, 2025
    • Python
      0200Updated Sep 23, 2025Sep 23, 2025
    • Code and data for the paper: "Hell or High Water: Evaluating Agentic Recovery from External Failures"
      Python
      MIT License
      0600Updated Aug 14, 2025Aug 14, 2025
    • Jupyter Notebook
      0200Updated Aug 11, 2025Aug 11, 2025
    • 0200Updated Aug 6, 2025Aug 6, 2025
    • State-of-the-art paired encoder and decoder models (17M-1B params)
      Python
      MIT License
      56901Updated Aug 6, 2025Aug 6, 2025
    • Code for paper FEEDBACK FRICTION: LLMs Struggle to Fully Incorporate External Feedback https://arxiv.org/pdf/2506.11930
      Python
      0800Updated Jun 16, 2025Jun 16, 2025
    • Python
      1100Updated Jun 12, 2025Jun 12, 2025
    • NeoCoder

      Public
      Official implementation of our paper "Benchmarking Language Model Creativity: A Case Study on Code Generation"
      Python
      Apache License 2.0
      41200Updated May 16, 2025May 16, 2025
    • Bringing BERT into modernity via both architecture changes and scaling
      Python
      Apache License 2.0
      145000Updated Apr 22, 2025Apr 22, 2025
    • Code and dataset for the paper: Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol (https://arxiv.org/pdf/2504.10284)
      Python
      MIT License
      0120Updated Apr 21, 2025Apr 21, 2025
    • Code for paper "Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data"
      Python
      0300Updated Apr 21, 2025Apr 21, 2025
    • The repo for the paper "CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers? "
      1200Updated Apr 20, 2025Apr 20, 2025
    • Web Agent Arena
      HTML
      0000Updated Apr 10, 2025Apr 10, 2025
    • Python
      8600Updated Apr 7, 2025Apr 7, 2025
    • Python
      1010Updated Feb 15, 2025Feb 15, 2025
    • This project focus on curating a robust analogical reasoning dataset for research and development.
      Python
      2600Updated Dec 18, 2024Dec 18, 2024
    • Web-grounded natural language instructions
      HTML
      Apache License 2.0
      61830Updated Nov 25, 2024Nov 25, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.