Skip to content
Change the repository type filter

All

    Repositories list

    • Public End-to-End joint ASR and Diarization codes for child-adult interactions
      Python
      1700Updated Feb 6, 2026Feb 6, 2026
    • Deep learning for eye tracking trials
      Python
      MIT License
      3200Updated Feb 3, 2026Feb 3, 2026
    • Learning descriptions of literary and fictional characters
      Python
      MIT License
      0500Updated Oct 14, 2025Oct 14, 2025
    • Python
      0000Updated Aug 22, 2025Aug 22, 2025
    • Estimating External Stressors in Driving through Multimodal Physiological Monitoring
      Python
      MIT License
      0100Updated Jun 30, 2025Jun 30, 2025
    • [KDD 2023] FedMultimodal: A Benchmark For Multimodal Federated Learning
      Python
      Apache License 2.0
      2414320Updated May 24, 2025May 24, 2025
    • public child-adult speaker diarization/classification model and codes
      Python
      31810Updated Apr 24, 2025Apr 24, 2025
    • M3BERT

      Public
      A music transformer that extracts representations of audio using several hundreds of thousands of music clips. Fine-tuning is done with diverse end-tasks to enr…
      Python
      MIT License
      0400Updated Nov 17, 2024Nov 17, 2024
    • ccmi-sear

      Public
      Public repository for SEAR audio model
      Python
      MIT License
      0000Updated Oct 17, 2024Oct 17, 2024
    • SAIM-ADS

      Public
      Repository for experiments and preprocessing related to advertisement videos analysis
      Jupyter Notebook
      0100Updated Sep 28, 2024Sep 28, 2024
    • Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?
      Python
      Apache License 2.0
      1800Updated Aug 29, 2024Aug 29, 2024
    • peft-ser

      Public
      [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models
      Python
      Apache License 2.0
      95900Updated Jul 1, 2024Jul 1, 2024
    • Movie Screenplay Parser
      Python
      GNU General Public License v3.0
      31310Updated Apr 29, 2024Apr 29, 2024
    • Contains code to scrape scriptsonscreen scripts website and scrapped data
      Python
      GNU General Public License v3.0
      0100Updated Feb 28, 2024Feb 28, 2024
    • Coreference in Movie Scripts
      Python
      GNU General Public License v3.0
      0000Updated Feb 28, 2024Feb 28, 2024
    • Character tropes, Forensic Interviews, and Character Attributes
      Jupyter Notebook
      MIT License
      0200Updated Jan 15, 2024Jan 15, 2024
    • Data and code for analysis of the India TV Show Study
      PostScript
      MIT License
      0000Updated Nov 12, 2023Nov 12, 2023
    • Repo for SCMIA and GSCMIA
      0100Updated Oct 7, 2023Oct 7, 2023
    • This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
      Python
      MIT License
      44240Updated Oct 1, 2023Oct 1, 2023
    • Repository for context based emotion recognition
      Python
      MIT License
      0000Updated Sep 25, 2023Sep 25, 2023
    • trust-ser

      Public
      Trustworthy Speech Emotion Recognition
      Python
      Apache License 2.0
      31300Updated May 22, 2023May 22, 2023
    • Codebase for analyzing RP metrics for MUSE LoREAL study
      Jupyter Notebook
      0000Updated May 11, 2023May 11, 2023
    • SAIL-CCMI

      Public
      Outline of the webpage for CCMI subgroup
      SCSS
      1.3k000Updated May 8, 2023May 8, 2023
    • Egocentric Foreground Speech Detection
      Python
      2500Updated Apr 27, 2023Apr 27, 2023
    • CLAP

      Public
      Contrastive Language-Audio Pretraining
      Python
      Creative Commons Zero v1.0 Universal
      213000Updated Apr 21, 2023Apr 21, 2023
    • llama

      Public
      Inference code for LLaMA models
      Python
      GNU General Public License v3.0
      9.8k000Updated Apr 8, 2023Apr 8, 2023
    • Segmentation Algorithms for Physiological Time Series
      Python
      MIT License
      3500Updated Mar 21, 2023Mar 21, 2023
    • Python
      0200Updated Feb 10, 2023Feb 10, 2023
    • A dataset for Audio-Visual Sound Event Detection in Movies
      Python
      12630Updated Jan 23, 2023Jan 23, 2023
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Jupyter Notebook
      MIT License
      12k000Updated Dec 20, 2022Dec 20, 2022
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.