Skip to content
View Wxssxm's full-sized avatar

Block or report Wxssxm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. scraper-watchdog scraper-watchdog Public

    Python

  2. street-food-intel street-food-intel Public

    Python

  3. nyc-taxi-analytics nyc-taxi-analytics Public

    High-performance SQL analytics on NYC TLC Yellow Taxi parquet files using DuckDB, no warehouse needed.

    Python

  4. hacker-news-data-lake hacker-news-data-lake Public

    Bronze/Silver/Gold data lake on the Hacker News Firebase API, orchestrated by Airflow with MinIO and partitioned Parquet. Async httpx ingestion, ExternalTaskSensor-gated DAG dependencies, DuckDB-bu…

    Python

  5. kafka-streaming-pipeline kafka-streaming-pipeline Public

    Real-time e-commerce events: Python producer -> Kafka KRaft -> Spark Structured Streaming -> Postgres -> Streamlit dashboard. Windowed aggregations, anomaly detection, full docker-compose stack.

    Python

  6. open-data-etl open-data-etl Public

    Batch ETL pipeline ingesting French open-data DVF (real-estate transactions) into a Parquet star schema with DuckDB views. Polars streaming, idempotent download, partitioned warehouse.

    Python