Skip to content

pgoyal77/ML-Driven-Biomarker-Identification-for-Early-Disease-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ML-Driven Biomarker Identification for Early Disease Detection

This repository contains the work for my Major Technical Project-1 during 4th Year B.Tech, which explores the use of machine learning for the identification of potential biomarkers based on patient metabolite data. The goal is to aid in early disease diagnosis by extracting biologically significant features from large-scale sample datasets.


Problem Statement

To develop a machine learning model that can classify metabolite data from patients and identify a small subset of biomarkers that are predictive of disease occurrence.


Project Highlights

  • Analyzed 100 patient samples (cases + controls) for metabolite profiling.
  • Used MetaboAnalyst for:
    • Data normalization
    • Preprocessing
    • PCA visualization
  • Applied ML models:
    • Support Vector Machine (SVM)
    • XGBoost
    • Random Forest
  • Achieved 92% data similarity when validated against known datasets.

Tools & Technologies

  • Python (pandas, scikit-learn, xgboost, seaborn, matplotlib)
  • MetaboAnalyst (Web-based metabolomic data analysis)
  • Jupyter Notebook
  • PDF report making for methodology and results

Learnings

  • Gained hands-on experience with biomarker discovery pipelines.
  • Learned feature selection and model comparison in bioinformatics.
  • Practiced end-to-end machine learning workflows with biomedical data.
  • Validated findings using both biological context and model outputs.

About

This project focuses on biomarker identification using Machine Learning (ML) for early disease detection. The workflow included data preprocessing using MetaboAnalyst, followed by training and evaluation of multiple ML models like SVM, XGBoost, and Random Forest.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors