You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this repository I have developed ETL data pipeline using GCP services like GCS, Big Query, Dataproc, Cloud composer(Airflow) and also setting up GCP resources this repository is python 7 sql based, code is parameterized, so values like, path's, credentials are not hardcoded, also demonstrating few patterns used in production like pipelines.
This project analyzes a technology product sales dataset to derive business insights, enhance data analysis skills, & data visualization. Key steps include EDA & Pre-Processing, analysis, and visualization, leading to strategic recommendations for improving sales performance.
This project analyzes 19M+ flight records (sourced from Kaggle in Parquet format) to uncover trends in delays, taxi times, and cancellations—driving insights for improved airline scheduling and operational efficiency.