Particle Imaging Models (pimm)

Foundation model research for particle imaging detectors

A codebase for perception research for time projection chambers (TPCs), with a focus on liquid argon TPCs, built on the Pointcept training and inference framework.

This repository currently deals with 3D charge clouds only, with plans to incorporate 2D images (e.g., wireplane waveforms) and other modalities in the near future.

Overview

pimm adapts methods in deep learning and computer vision for event reconstruction in LArTPC detectors. This repository provides:

Self-supervised pre-training: discriminative pre-training (Sonata) for learning good representations of LArTPC images.
Panoptic segmentation (PointGroup, Panda Detector) models for particle and interaction instance/semantic segmentation
Semantic segmentation models for per-pixel segmentation.

In sum, pimm integrates the following works:
Backbone: MinkUNet, SpUNet (see SparseUNet), PTv1, PTv2, PTv3 (see Point Transformers), Instance Segmentation: PointGroup (see PointGroup),
Panda Detector (see Panda Detector);
Pre-training: Sonata (see Sonata), PoLAr-MAE (see PoLAr-MAE);
Datasets: PILArNet-M (see PILArNet-M)

TODO

We are looking at including the following models/modalities in the future:

SPINE, up until postprocessing module
PoLAr-MAE pre-training and fine-tuning
2D TPC waveforms/networks, e.g., NuGraph
Optical waveforms

Quick Start

Using the container (recommended)

git clone https://github.com/DeepLearnPhysics/particle-imaging-models.git
cd particle-imaging-models
apptainer pull /path/to/pimm.sif docker://youngsm/pimm:pytorch2.5.0-cuda12.4

Train (single GPU):

apptainer exec --nv --bind XXX /path/to/pimm.sif \
  sh scripts/train.sh -g 1 -d panda/pretrain -c pretrain-sonata-v1m1-pilarnet-smallmask

where XXX is a directory (not including your home path) you'd like to ensure that pimm will be able to see inside the container. This is not needed unless you are working on an HPC with directories organized in a non-standard way. For example, at SLAC National Laboratory's S3DF cluster, you must --bind /sdf,/lscratch.

Multi-GPU:

Change -g 1 (1 GPU) to -g 4 (4 GPUs), or omit -g to use all available GPUs.

Multi-Node:

For training on Slurm configurations, you can use the multinode.slurm.sbatch file in scripts/slurm/ to submit your sbatch job.

To get started just adjust the number of nodes and GPUs

#SBATCH --ntasks-per-node=4
#SBATCH --nodes=2

Then modify -m and -g to the number of nodes and number of tasks (i.e., GPUs) per node: -m 2 -g 4

See Dataset Preparation to download PILArNet-M.

From source

git clone https://github.com/youngsm/particle-imaging-models.git
cd particle-imaging-models
conda env create -f environment.yml
conda activate pimm-torch2.5.0-cu12.4
sh scripts/train.sh -g 1 -d panda/pretrain -c pretrain-sonata-v1m1-pilarnet-smallmask

Requires CUDA 11.6+ for FlashAttention (set enable_flash=False in configs if unavailable).

Multi-Node Training

SLURM templates are provided for multi-node training:

cp scripts/slurm/multinode.slurm.sbatch my_job.sh   # generic / SLAC cluster
cp scripts/slurm/multinode.nersc.sbatch my_job.sh   # NERSC Perlmutter
# edit SBATCH headers + experiment section, then:
sbatch my_job.sh

The key rule: --ntasks-per-node must equal the number of GPUs per node. The training script handles distributed setup automatically via SLURM environment variables.

Training & Testing

The entry point is scripts/train.sh:

sh scripts/train.sh -d <dataset> -c <config> [options]

Flag	Description
`-d`	Config directory (e.g., `panda/pretrain`, `panda/semseg`)
`-c`	Config name without `.py`
`-n`	Experiment name (default: auto-generated)
`-g`	GPUs per machine (default: all available)
`-m`	Number of machines (default: 1)
`-w`	Path to checkpoint (to be used by CheckpointLoader)
`-r true`	Resume training from last checkpoint
`-C`	Dev mode: skip code snapshot, run from repo source
`-h`	Show full help

# Override config values
sh scripts/train.sh -d panda/pretrain -c pretrain-sonata-v1m1-pilarnet-smallmask \
  -- --options epoch=10 data.train.max_len=1000

# Fine-tune from pre-trained checkpoint
sh scripts/train.sh -g 4 -d panda/semseg -c semseg-pt-v3m2-pilarnet-ft-5cls-lin \
  -w /path/to/checkpoint.pth

# Resume
sh scripts/train.sh -d panda/pretrain -c pretrain-sonata-v1m1-pilarnet-smallmask \
  -n my_experiment -r true

See Config Structure for more on how configs work.

By default, train.sh snapshots the codebase into exp/<dataset>/<name>/code/ and runs the code from this snapshot for reproducibility. Use -C to skip this during development.

Model checkpoints, which can be quite large, are saved to exp/<dataset>/<name>/model/. To redirect to a separate disk, set MODEL_DIR in your .env file or environment; this will save the checkpoint to MODEL_DIR and symlink it to exp/<dataset>/<name>/model.

Configuration System

Configurations are Python dictionary-based files located in the configs/ directory. Each config file defines the model architecture, dataset settings, training hyperparameters, and different hooks to run during training (checkpoint saving, logging, evaluation).

Config Structure

Configs use a hierarchical structure with _base_ inheritance:

_base_ = ["../../_base_/default_runtime.py"]

# Override or add settings
model = dict(type="PT-v3m2", ...)
data = dict(train=dict(...), val=dict(...))

Modifying Configs

You can modify configs in two ways:

Edit the config file directly

Override via command line using --options:

sh scripts/train.sh ... -- --options epoch=50 data.train.max_len=500000

Example configs can be found in:

configs/panda/pretrain/ - Pre-training configurations
configs/panda/semseg/ - Semantic segmentation configurations
configs/panda/panseg/ - Panoptic segmentation configurations

Dataset Preparation

PILArNet-M

Download the 168GB dataset from Hugging Face:

python tools/download_pilarnet.py --version v2 --output_dir /path/to/dir

Data saves to ~/.cache/pimm/pilarnet/v2 if output_dir is not provided. After downloading the dataset, run cp example.env .env and set PILARNET_DATA_ROOT_V2. This will allow the dataloader to automatically find the data.

PILArNet has two revisions. v2 is recommended for new models (adds PID, momentum, and vertex information). v1 is the original dataset from the PoLAr-MAE paper. Events differ between splits, so models trained on v1 should be evaluated on v1.

Data Format

Point cloud data should be organized with the following structure:

{
    'coord': (N, 3),           # 3D hit positions [x, y, z]
    'feat': (N, C),            # Hit features (charge, time, etc.)
    'segment': (N,1),          # Semantic labels (optional, for training)
    'instance': (N,1),         # Instance IDs (optional, for training)
    ...                        # Extra attributes
}

The data often needs to be re-scaled to new domains that lead to more efficient training (e.g., centering/scaling of coordinates to [-1,1]$^3$). This can be done within the Dataset class, or from a Transform. See the transform sections of configuration files for more details.

Packed Data Format

This library works with packed data, where all batched quantities are in two dimensions instead of three, i.e. (N, 3) instead of (B, N, 3). This is because point clouds are variable length, and getting to a 3 dimensional tensor would require padding. Instead of padding, there is an offset tensor, which is of length B and gives the indices in the packed tensors at which a point cloud ends and a new one starts.

Offset is conceptually similar to the concept of Batch in PyG, and can be seen as the cumulative sum of a lengths tensor. A visual illustration of batch and offset is as follows:

pointcept

Docker / Apptainer

Pre-built images are available on Docker Hub:

Image	Description
`youngsm/pimm:pytorch2.5.0-cuda12.4`	Standard image
`youngsm/pimm-nersc:pytorch2.5.0-cuda12.4`	NERSC variant with extra dependencies

apptainer pull /path/to/pimm.sif docker://youngsm/pimm:pytorch2.5.0-cuda12.4

Model Zoo

Model Versioning

Models use vXmY naming (version X, mode Y). Different modes indicate small architecture variants, while versions indicate large architectural changes.

Backbones

PTv3 (Point Transformer V3) — efficient backbone with FlashAttention. Requires spconv and CUDA 11.6+ for FlashAttention (which is optional)
SparseUNet — SpConv-based UNet.
PTv2, PTv1 — earlier Point Transformer versions.

Pre-training

Panda/Sonata — DINO-style self-supervised learning with teacher-student framework and online prototype clustering.
PoLAr-MAE — masked autoencoder with chamfer + energy reconstruction losses.

Instance / Panoptic Segmentation

PointGroup — clustering-based instance segmentation.
Panda Detector — Mask2Former-style detection modified to take low energy deposits into account.

Logging

Both TensorBoard and Weights & Biases are enabled by default. Set use_wandb=False to disable W&B. To authenticate, either run wandb login or add WANDB_API_KEY=your_key to your .env file (see example.env).

hooks = [
    dict(type="WandbNamer", keys=("model.type", "data.train.max_len", "amp_dtype", "seed")),
    ...
]

Acknowledgements

Built on Pointcept. Thanks to them!

License

MIT (inherited from Pointcept).

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
configs		configs
docker		docker
docs		docs
libs		libs
pimm		pimm
scripts		scripts
tests		tests
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
example.env		example.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Particle Imaging Models (pimm)

Foundation model research for particle imaging detectors

Overview

TODO

Quick Start

Using the container (recommended)

Train (single GPU):

Multi-GPU:

Multi-Node:

From source

Multi-Node Training

Training & Testing

Configuration System

Config Structure

Modifying Configs

Dataset Preparation

PILArNet-M

Data Format

Packed Data Format

Docker / Apptainer

Model Zoo

Model Versioning

Backbones

Pre-training

Instance / Panoptic Segmentation

Logging

Acknowledgements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Particle Imaging Models (pimm)

Foundation model research for particle imaging detectors

Overview

TODO

Quick Start

Using the container (recommended)

Train (single GPU):

Multi-GPU:

Multi-Node:

From source

Multi-Node Training

Training & Testing

Configuration System

Config Structure

Modifying Configs

Dataset Preparation

PILArNet-M

Data Format

Packed Data Format

Docker / Apptainer

Model Zoo

Model Versioning

Backbones

Pre-training

Instance / Panoptic Segmentation

Logging

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages