You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A complete mini-project demonstrating how to process, clean, and analyze 100,000 synthetic bank transaction records using PySpark in Databricks. It includes real-world data engineering tasks like data ingestion, null handling, feature engineering, transaction grouping, and business-level reporting, with output stored in Parquet format for BI-ready.