Skip to content
#

class-imbalance-handling

Here are 16 public repositories matching this topic...

(WIP): 'Aporia' in Greek means 'inconsistent'. A Python library that detects and fixes dataset issues using both rule-based methods and ML models. It evaluates dataset quality across multiple metrics, including missing values, duplicates, outliers, class imbalance, and label consistency. It also suggests fixes based on the metric scores.

  • Updated Mar 28, 2025
  • Jupyter Notebook

Predicting company bankruptcy using various machine learning models. The dataset is sourced from Kaggle: Company Bankruptcy Prediction.

  • Updated Aug 7, 2024
  • Python

This project focuses on detecting fraudulent credit card transactions using Machine Learning and Data Analytics. It applies advanced techniques such as EDA (Exploratory Data Analysis), feature engineering, and imbalance handling (SMOTE, undersampling) to improve fraud detection accuracy.

  • Updated Aug 11, 2025
  • Jupyter Notebook

End-to-end machine learning workflow on the Combined Cycle Power Plant dataset: data cleaning, EDA, outlier removal, feature engineering, class balancing, and model evaluation for regression and classification. Includes code, visualizations and best practices in a single Jupyter notebook.

  • Updated Feb 12, 2026
  • Jupyter Notebook

A binary classification task performed with machine learning in Python. The dataset's target distribution is heavily imbalanced. The model performance was evaluated with F1 scores.

  • Updated Dec 2, 2024
  • Jupyter Notebook

Developed an ensemble ML classification model to predict U.S. visa case outcomes (Certified vs Denied) using applicant and employer attributes. Performed EDA, sampling, and model tuning (Random Forest, Gradient Boosting, XGBoost) to improve decision efficiency and identify key policy drivers like education, experience, and wage trends.

  • Updated Mar 31, 2026
  • Jupyter Notebook

A dual-part finance and retail analytics project covering credit default prediction for companies using machine learning (Logistic Regression & Random Forest) and market risk analysis of a five-stock Indian equity portfolio using historical price and return data.

  • Updated Apr 1, 2026
  • Jupyter Notebook

This is a production-ready, end-to-end system developed to detect and classify racist tweets using advanced Natural Language Processing (NLP) techniques. Built on top of BERTweet (vinai/bertweet-base) and fine-tuned with a robust, k-fold cross-validation training pipeline, powered by streamlit UI!

  • Updated Jul 23, 2025
  • Python

Improve this page

Add a description, image, and links to the class-imbalance-handling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the class-imbalance-handling topic, visit your repo's landing page and select "manage topics."

Learn more