Customer Churn Prediction Pipeline (Python)

Context

This project simulates a customer analytics use case and demonstrates how machine learning can be used to identify customers with a high churn risk.

It is designed as an end-to-end pipeline that reflects real-world data workflows in a simplified and structured way.

Overview

The pipeline generates synthetic customer data, stores it in a database, trains a machine learning model, predicts churn risk, and provides basic analysis and visualization of the results.

The goal is to demonstrate how business-relevant insights can be derived from data using a clean and modular approach.

Use Case

Typical applications of this type of pipeline include:

identifying customers with high churn risk
supporting retention strategies
prioritizing customer follow-ups
analyzing behavioral patterns

Technologies

Python
pandas
numpy
scikit-learn
SQLite
matplotlib
joblib

Pipeline Steps

Generate synthetic customer data
Store structured data in SQLite
Train a Random Forest classification model
Predict churn risk
Evaluate model performance (accuracy, classification report)
Export predictions to CSV
Save trained model
Analyze customer segments
Visualize churn distribution
Visualize feature importance
Predict churn risk for a new customer

Output

The pipeline generates the following outputs:

data/customer_data.db (database)
data/churn_predictions.csv (predictions)
data/churn_distribution.png (visualization)
data/feature_importance.png (model insights)
models/risk_model.pkl (trained model)

Example Result

The model predicts whether a customer is likely to churn (1) or not (0) based on behavioral and demographic features.

Additional outputs include:

churn rate
average values per risk group
feature importance ranking

Run the project

pip install -r requirements.txt
python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.venv		.venv
customer-analytics-pipeline		customer-analytics-pipeline
images		images
src		src
.gitignore		.gitignore
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Churn Prediction Pipeline (Python)

Context

Overview

Use Case

Technologies

Pipeline Steps

Output

Example Result

Run the project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Customer Churn Prediction Pipeline (Python)

Context

Overview

Use Case

Technologies

Pipeline Steps

Output

Example Result

Run the project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages