Spine segmentation using Deep Learning

Repository to track the progress for AI in Medical Imaging Diagnostics project about spine segmentation.

Getting Started

Prerequisites

Python 3.9.21 (recommended)

Setup

Create a virtual environment:

python3.9 -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Download datasets from the sources listed below and place them in the appropriate directories as specified in config.json or modify config itself.
Run the training pipeline:

./cli.py

If configured properly, cli.py will:

Preprocess the datasets
Create dataloaders with train/val/test splits
Start training the model

You can specify a custom config file using --config:

./cli.py --config my_config.json

It is recommended to place new configs for separate experiments in experiments/ directory in appropriately named subdirectory.

Preprocessing

Preprocessing is executed automatically if preprocessed_data_dir in Config do not exist and data sources in PreprocessingConfig are available. So if you want to create another version of preprocessing modify preprocessing code and run cli.py with different preprocessed_data_dir.

Project Structure

The project follows a modular structure with components in the src/ directory:

config.py: Configuration management using dataclasses
preprocessing.py: Data preprocessing pipeline
dataloader.py: PyTorch dataloaders with data augmentation
model.py: Model factory and architecture definitions
train.py: Training loop and validation
loss.py, metrics.py, optimizer.py, scheduler.py: Training components

Dependency Injection

The project uses a single Config object (from src.config) that is dependency-injected throughout the codebase. This Config object contains all hyperparameters, paths, and settings organized into nested dataclasses:

TrainingConfig: Training hyperparameters (learning rate, epochs, scheduler, etc.)
PreprocessingConfig: Data preprocessing settings
DataLoaderConfig: Dataloader configuration and augmentation parameters
ModelConfig: Model architecture parameters
WandBConfig: Weights & Biases logging configuration

All components receive the same Config instance, ensuring consistency and making it easy to manage experiments through JSON configuration files.

Additional modules

After change of Config structure run python -m src.config in order to regenerate default_config.json. After changes made to dataloader.py you can run python -m src.dataloader in order to test it. Similarly for preprocessing.py

Hyperparameter Sweeps

For hyperparameter optimization using Weights & Biases, see WANDB_SWEEP_README.md.

EDA

EDA was done in EDA/datasets_eda.ipynb.

Datasets' sources:

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
EDA		EDA
experiments		experiments
src		src
sweep_results		sweep_results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WANDB_SWEEP_README.md		WANDB_SWEEP_README.md
cli.py		cli.py
default_config.json		default_config.json
requirements.txt		requirements.txt
train.ipynb		train.ipynb
validate-on-osf.py		validate-on-osf.py
validate.py		validate.py
wandb_sweep.py		wandb_sweep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spine segmentation using Deep Learning

Getting Started

Prerequisites

Setup

Preprocessing

Project Structure

Dependency Injection

Additional modules

Hyperparameter Sweeps

EDA

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spine segmentation using Deep Learning

Getting Started

Prerequisites

Setup

Preprocessing

Project Structure

Dependency Injection

Additional modules

Hyperparameter Sweeps

EDA

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages