This document provides the full set of instructions to reproduce our project results from scratch, including data setup, environment configuration, training, and evaluation.
.
├── data/ # Contains raw and processed datasets
├── src/ # All source code (models, training, evaluation)
├── requirements.txt # Python dependencies
├── README.md # README file
├── REPRO.md # This file
├── XXXXX
├── XXXXXSetup project by running the following commands:
# Example -- overwrite if needed
conda create -n XXXXX python=XXXX
conda activate XXXXX
pip install -r requirements.txtPlace your datasets in the XXXX/ directory.
mkdir -p data/example_dataset
cd data/example_dataset
wget xxxxx
python -m src.preprocess_example_dataset.py xxxx
cd ../..Set your parameters in the config file before training. Example:
Run the following command to train the baseline:
python XXXXTo perform inference:
python XXXXAlternatively, execute the following slurm jobs:
sbatch job_scripts/train_xxxxx.job
sbatch job_scripts/infer_xxxxx.jobAfter training, evaluate all models with:
python XXXXThis project repository uses the following frameworks / refers to the following papers:
- XXX
- XXX