Skip to content

nagaraju-12/Bank-customer-pyspark-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

About

A complete mini-project demonstrating how to process, clean, and analyze 100,000 synthetic bank transaction records using PySpark in Databricks. It includes real-world data engineering tasks like data ingestion, null handling, feature engineering, transaction grouping, and business-level reporting, with output stored in Parquet format for BI-ready.

Topics

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors