Skip to content

[WIP] Mbrainz Dataset

Christian Romney edited this page Jan 18, 2015 · 2 revisions

If you're using this Docker image to follow the Datomic Tutorials or video training, you'll want to load the Mbrainz Dataset. To help make this easy, we've bundled a shell script: /data/restore-mbrainz.sh. Before you run this script, however, there are a few modifications you'll need to make to the container running Datomic Pro Starter.

If you're running boot2docker on Mac OS X via Virtualbox (as I am), your boot2docker image likely only has 2GB RAM. Stop the boot2docker vm with boot2docker down, and increase the RAM. I use 4096 or 4GB of RAM on my Mac Book Pro with 16GB total.

Next, you'll want to pass two environment variables to the Docker container:

--env XMS="-Xms2g" --env XMX="-Xmx2g"

You can also set them in fig.yml if you are using Fig:

datomicdb:
  build: db/datomic  
  ports:
    - "4336:4336"
    - "4335:4335"
    - "4334:4334"
  environment:
    XMS: "-Xms2g"
    XMX: "-Xmx2g"

This will give the container enough RAM to load the dataset. Now you can load the data into your container by executing:

$ docker exec <container> /data/restore-mbrainz.sh <storage>

where is the name or id of the running Datomic Pro Starter container and is the name of the Datomic storage you are using (e.g. dev).

Clone this wiki locally