AudioVJ AI

Real-time DJ phrase detection for lighting/visual control.

Goal

The goal of this project is to create a real-time DJ phrase detection system that can be used for lighting and visual control during live performances. No need for pre-processing of audio tracks or syncing, just supply an audio signal of the DJ's output.

🚧 Work In Progress

Model architecture is very rough still.
The performance isn't great yet.
For tempo and phase detection, Ableton Link/Carabiner is used in the meantime, until I can get the phrase detection working well enough. Then, I can focus on implementing another model to track tempo and phase.

Pipeline

1. Label tracks in rekordbox

Create a playlist named "audiovj" and add tracks you want to use for training. The name of the playlist can be configured when importing, in step 2, but by default it is "audiovj"
For each song in the "audiovj" playlist,
1. Make sure the beatgrid is accurate, including the downbeats.
2. Use hot cue pads to label the start of each phrase. Each letter maps to a phrase type:
  - A = intro
  - B = verse
  - C = buildup
  - D = drop
  - E = breakdown
  - F = outro
  Not all cues are required per track. For example, labeling only A (intro), C (buildup), and D (drop) is fine if the track doesn't have a distinct verse or breakdown. Since each letter can only be used once per track, repeated sections (e.g., a second drop) are left unlabeled — the model is expected to generalize from the labeled instances.
Once done, export the library to an XML file (File > Export Collection in xml format).
Make sure the audio tracks are in a folder that you can point to when importing. If needed, they can also be copied to a new location. The exact folder structure does not matter, the import script will recursively search for audio files, and match them to the tracks in the XML file based on filename. Tip: you can drag and drop tracks from rekordbox into your OS' file explorer to copy them to a new location (only tested on Mac).

2. Import tracks from rekordbox

uv run audiovj import-rekordbox <path-to-library.xml> <path-to-audio-folder>

Only imports tracks in the "audiovj" playlist. Override with --playlist <name>.

3. Preprocess audio

uv run audiovj preprocess

4. Inspect a track

uv run audiovj inspect <track_id>

5. Train

uv run audiovj train [--epochs 50] [--batch-size 8] [--lr 1e-3]

6. Evaluate

uv run audiovj evaluate

7. Predict on a track

uv run audiovj predict-file <track_id>

8. Predict on live audio

uv run audiovj run-live --audio-device <index|name> --audio-channels <ch,ch>

List Devices

uv run audiovj list-devices

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src/audiovj		src/audiovj
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioVJ AI

Goal

🚧 Work In Progress

Pipeline

1. Label tracks in rekordbox

2. Import tracks from rekordbox

3. Preprocess audio

4. Inspect a track

5. Train

6. Evaluate

7. Predict on a track

8. Predict on live audio

List Devices

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AudioVJ AI

Goal

🚧 Work In Progress

Pipeline

1. Label tracks in rekordbox

2. Import tracks from rekordbox

3. Preprocess audio

4. Inspect a track

5. Train

6. Evaluate

7. Predict on a track

8. Predict on live audio

List Devices

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages