Skip to content

Pinafore/audio_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1,204 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio QA Dataset

This repository contains a collection of audio clips along with annotated question-answer pairs stored in a structured metadata file.

AUDITA

This dataset accompanies the paper:

AUDITA: A New Dataset to Audit Humans vs. AI Skill at Audio QA Tasnim Kabir, Dmytro Kurdydyk, Aadi Palnitkar, Liam Dorn, Ahmed Haj Ahmed, and Jordan Lee Boyd-Graber (2026)

Paper: https://arxiv.org/abs/2604.21766

If you use this dataset, please cite:

@article{kabir2026audita,
  title={AUDITA: A New Dataset to Audit Humans vs. AI Skill at Audio QA},
  author={Kabir, Tasnim and Kurdydyk, Dmytro and Palnitkar, Aadi and Dorn, Liam and Ahmed, Ahmed Haj and Boyd-Graber, Jordan Lee},
  journal={arXiv preprint arXiv:2604.21766},
  year={2026}
}

📁 Folder Structure

. ├── audio/ # Contains actual audio files referenced in the metadata └── combined.json # Metadata with QA pairs and file references

📄 File Descriptions

combined.json

A list of JSON objects, each representing a QA annotation for an audio file.

Human-Ready Unsplit Files Overview

This dataset consists of unsplit, human-ready question-answer files, categorized as follows:

OUR Sources

Quizbowl-style

  • Pavements: 673 questions
  • Audio-Packets: 1,649 questions

Trivia-style

  • Quizmasters: 4,138 questions

Subtotal (OUR): 6,460 questions

EXTERNAL Sources

Close-Ended Questions

  • OpenAQA: 882 questions
  • ClothoAQA: 323 questions

Open-Ended Questions

  • OpenAQA: 2,025 questions

Subtotal (EXTERNAL): 3,230 questions


Total Human-Ready Questions: 9,690

Each entry contains:

Field Description
question The question posed about the audio content
dataset The source dataset (e.g., clotho_aqa)
file_name Path to the corresponding audio file
task Type of task (e.g., closed_ended, open_ended)
ground_truth The correct answer to the question
Categories High-level category of the sound (e.g., Character/Person)
Subcategories More specific label (or N/A if not applicable)

Example Entry

{
  "question": "Are humans heard?",
  "dataset": "clotho_aqa",
  "file_name": "/data/clotho_aqa/Backyard nature.wav",
  "task": "closed_ended",
  "ground_truth": "yes",
  "Categories": "Character/Person",
  "Subcategories": "N/A"
}

Note: Only the actual audio files (e.g., Backyard nature.wav or 123456.flac) are placed in the audio/ folder. You do not need to search the full path listed in file_name. Just extract the final filename from the path and look for it in the audio/ directory.

audio/ folder

This folder contains only the final audio files referenced in combined.json, in .wav or .flac format.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors