Skip to content

What.cd metadata preservation #11

@denisnazarov

Description

@denisnazarov

As you may have heard, what.cd, a private torrent tracker that is often called "the greatest music collection in the history of the world," was permanently shut down last week.

By many accounts, it was one of the most thorough music catalogs in history, and its shutting down represents a huge loss of cultural heritage data.

E.g.:

“Collages” were one of What’s best features. Users arranged lists of albums on the site into useful categories like “Intro to free jazz” or “Bands with a male and female singer.” These were indispensable sources of musical discovery.

The goal of this issue is to coordinate an effort to preserve the metadata from what.cd in Mediachain (albums, artists, tracks, collages) for the purposes of cultural preservation (not the torrents or media itself).

Why Mediachain is the appropriate solution:

  • everything is immutable and crypto signed so provenance/authorship of the data is permanently established
  • content-addressed links are location-independent and permanent: an organization such as Internet Archive can act as the primary host of the assets, but other orgs or users can mirror whole or part automatically
  • mediachain provides decentralized resolution from "canonical" IDs like ISRC/ISWC/etc to the content-addressed hashes so we can ensure the links keep working no matter where the data lives
  • dataset remains richly structured and accessible, instead of being a multi-terabyte dump on a server somewhere that only specialized researchers ever use (as often happens with these kinds of exports)

Please comment below if you have access to a recent What.cd metadata dump or would like to help in some other way.

Related to #10

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions