Skip to content
View murillo-ro-silva's full-sized avatar
🤚
Just Working!
🤚
Just Working!

Block or report murillo-ro-silva

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
murillo-ro-silva/README.md

Hey, I'm Murillo 👋

Data Engineering Lead @ Branching Minds and Founder of Datoga.io.
I design and lead modern data platforms with Databricks, Spark, dbt, Dagster, Airbyte—turning messy data into reliable, governed, and cost-efficient pipelines.

Now

  • Leading a Data Engineering team building a new data architecture (files from multiple US school districts → reliable datasets powering apps and analytics).
  • Running Datoga.io, a consultancy focused on Modern Data Stack, Data Quality/Governance, and AI-assisted automation.

What I’m good at

  • Modern Data Platforms: lakehouse design, ingestion → bronze/silver/gold, SCD/Snapshots, cost/perf tuning.
  • Data Quality & Governance: contracts, tests, lineage, SLAs, access with Unity Catalog.
  • Orchestration & Ops: Dagster jobs, event-driven pipelines, CI/CD for data, observability and alerts.
  • Enablement: standards, templates, and docs that help teams ship faster with fewer regressions.

Selected outcomes

  • Standardized ingestion across vendors and districts, reducing ad-hoc work and incidents.
  • Introduced snapshot/SCD patterns to cut unnecessary reprocessing and improve downstream reliability.
  • Evolved data quality checks and metadata to support governance and safer product releases.
  • Drove a practical migration path toward a scalable lakehouse with clear ownership.

Core stack

Databricks • Spark • dbt • Dagster • Airbyte • SQL • Python • Delta Lake • AWS (S3, Glue, Lambda, Athena)
Also used: Metabase, GitHub Actions, Terraform, APIs, Crawlers, GCP (BigQuery/PubSub) when needed.

Availability

I’m not actively job-seeking, but I’m open to selective conversations or short-term advisory/consulting that align with building resilient data platforms at scale.


Contact

LinkedInmurillo@datoga.io

Popular repositories Loading

  1. murillo-ro-silva murillo-ro-silva Public

    2

  2. egkatzioura.wordpress.com egkatzioura.wordpress.com Public

    Forked from gkatzioura/egkatzioura.wordpress.com

    Project from blog posts on https://egkatzioura.wordpress.com/

    JavaScript

  3. bililiufather bililiufather Public

    Java Portifolio - Project created for consumer sqs queue and invoke endpoint.

    Java

  4. birdie birdie Public

    Jupyter Notebook

  5. challenge challenge Public

    Forked from Creditas/challenge

    Team recruiting challenges

    JavaScript

  6. pyspark_test1 pyspark_test1 Public

    Jupyter Notebook