Neural Network GPU/CPU Profiling

This repository demonstrates profiling memory and compute usage of different neural network architectures (MLP, CNN, Transformer block) using PyTorch. It is designed to run on Mac MPS (Apple Silicon) or CPU.

Features

Implements three architectures:
- MLP (Multi-Layer Perceptron)
- Simple CNN
- Transformer Block
Generates synthetic data with varying batch sizes
Profiles CPU time and GPU memory usage (via MPS memory allocation)
Saves memory and compute plots to results/ folder

Setup

Clone the repository

git clone https://github.com/giabaow/NN-Profiling.git
cd NN-Profiling

Create a virtual environment and install dependencies

python3 -m venv venv
source venv/bin/activate
pip install torch torchvision matplotlib

Ensure results/ folder exists

mkdir -p results

Usage

Run the profiling script:

python profile.py

The script will:
- Generate synthetic data for each model
- Profile forward passes with increasing batch sizes
- Print per-layer CPU timing
- Print MPS memory usage
- Save plots of memory vs batch size to results/memory_vs_batch.png

File Structure

NN-Profiling/
├─ models.py        # MLP, CNN, TransformerBlock implementations
├─ data.py          # Synthetic data generator
├─ profile.py       # Profiling script (CPU/MPS + memory)
├─ results/         # Saved memory/compute plots
└─ README.md

Output

Memory Usage Plot:

Layer	Self CPU Time (ms)
Linear	0.807
ReLU	2.630
MultiheadAttn	4.780

Actual numbers will vary depending on batch size and machine.

Notes Mac MPS Limitation: PyTorch profiler currently does not support MPS activity directly, so profiling only tracks CPU time. GPU memory is tracked with torch.mps.memory_allocated() and torch.mps.max_memory_allocated().

Adjust batch size: Modify batch_sizes in profile.py to experiment with different loads. Extending to CUDA: On NVIDIA GPUs, change the device to cuda and set profiler activities to [ProfilerActivity.CPU, ProfilerActivity.CUDA].

License

This project is MIT Licensed. Feel free to use and modify for research or learning purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Network GPU/CPU Profiling

Features

Setup

Usage

File Structure

Output

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
results		results
README.md		README.md
data.py		data.py
models.py		models.py
profile.py		profile.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Neural Network GPU/CPU Profiling

Features

Setup

Usage

File Structure

Output

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages