Skip to content
View ankitasaha34's full-sized avatar

Block or report ankitasaha34

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
ankitasaha34/README.md

Hi, I'm Ankita Saha ๐Ÿ‘‹

Data Analytics & BI Professional | MS Information Management @ UIUC | Previously Amazon & LTIMindtree

F-1 OPT STEM OPT Open to Work


About Me

I build production-grade data pipelines, BI dashboards, and AI-powered analytics tools. My background spans compliance analytics at Citibank (via LTIMindtree), audit BI at Amazon, and a Master's in Information Management at UIUC with a GPA of 3.95.

Over the past year I have been learning AI and LLM APIs from scratch and built three live projects to show what I learned.


๐Ÿค– AI Projects

Project What It Does Stack GitHub
AI Content Safety Classifier Dual-signal content moderation combining ML and Claude API. 86% recall, 49% ML vs LLM disagreement rate Python, scikit-learn, Claude API, Streamlit GitHub
T&S Ops Monitoring Dashboard 90-day ops dashboard with attack detection (67% recall) and Claude-generated weekly summaries Python, Streamlit, Plotly, Claude API GitHub
T&S Analyst Copilot NL-to-SQL copilot with agentic retry loop, data governance guardrails, and chart auto-detection Python, DuckDB, Claude API, Streamlit GitHub

๐Ÿ“Š Other Projects

Project Domain Stack
Healthcare Claims Analytics Pipeline Data Engineering, ML Python, SQL, scikit-learn, SMOTE
Retail Analytics Dashboard BI, Cloud Data Warehouse Azure, Power BI, DAX, Azure Data Factory
Car Insurance Claims on AWS Cloud ML, Risk Modeling AWS SageMaker, S3, Python, SHAP
Breast Cancer ML Classifier Evaluation ML Research Python, scikit-learn, GridSearchCV
Fast Fashion Supply Chain Optimization Analytics, Regression R, ggplot2, dplyr

๐Ÿ› ๏ธ Skills

Languages and Query SQL Python R DAX

BI and Visualization Power BI Tableau QuickSight Streamlit Plotly Advanced Excel

Cloud and Platforms Snowflake AWS Redshift AWS SageMaker

Data Engineering and ML ETL/ELT Pipelines Dimensional Modeling Star Schema Data Quality scikit-learn SHAP Anomaly Detection

Methods and Tools Agile Jira Alteryx Git/GitHub Stakeholder Management


๐Ÿ“œ Certifications

Azure AWS AWS AI


๐Ÿ“ˆ Experience

Amazon โ€” Business Intelligence Engineer Intern, Internal Audit (Summer 2025)

LTIMindtree โ€” Software Engineer, BI and Compliance Analytics at Citibank (2022 to 2024)

UIUC โ€” MS Information Management, GPA 3.95 (2024 to 2026)


๐ŸŒ Links

Portfolio LinkedIn Email

Popular repositories Loading

  1. DSCFest-Hacktoberfest DSCFest-Hacktoberfest Public

    Forked from Parth-tech/DSCFest-Hacktoberfest

    Hacktoberfestยฎ is open to everyone in our global community.

  2. IRIS-Prediction-using-unsupervised-ML IRIS-Prediction-using-unsupervised-ML Public

    Used jupyter notebook to perform prediction using unsupervised learning on the IRIS dataset & predict the optimum number of clusters and represent it visually

    Jupyter Notebook

  3. MyShoppingWebsite MyShoppingWebsite Public

    Shopping website made using Django

    HTML

  4. Retail_Analytics_A_Power_BI_Solution_Using_Azure_Cloud Retail_Analytics_A_Power_BI_Solution_Using_Azure_Cloud Public

  5. Fast-Fashion-Supply-Chain Fast-Fashion-Supply-Chain Public

  6. Comparative_Evaluation_of_ML_Classifiers_for_Breast_Cancer_Diagnosis Comparative_Evaluation_of_ML_Classifiers_for_Breast_Cancer_Diagnosis Public

    Machine learning project that compares multiple classification models to identify the most reliable approach for breast cancer diagnosis.

    Jupyter Notebook