Describe the bug
datasets 4.0 moved to using torchcodec for audio datasets, which doesn't support ARM hardware under Linux and Windows
Steps to reproduce the bug
try:
- Try to install
torchcodec under Linux ARM64
or:
- Try to install
torchcodec under Windows ARM64
or, on either of those platforms, using datasets >= 4.0:
dataset = load_dataset("garak-llm/audio_achilles_heel")
Expected behavior
dataset will load as expected
Environment info
ARM environment
Python 3.13+
datasets>=4.0
Using torchaudio
Describe the bug
datasets4.0 moved to usingtorchcodecfor audio datasets, which doesn't support ARM hardware under Linux and WindowsSteps to reproduce the bug
try:
torchcodecunder Linux ARM64or:
torchcodecunder Windows ARM64or, on either of those platforms, using datasets >= 4.0:
dataset = load_dataset("garak-llm/audio_achilles_heel")Expected behavior
dataset will load as expected
Environment info
ARM environment
Python 3.13+
datasets>=4.0Using
torchaudio