Skip to content
View owgreen-dev's full-sized avatar

Block or report owgreen-dev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
owgreen-dev/README.md

OG Green

Data scientist and builder working on public-data fraud analytics, program-integrity tools, and AI-assisted developer workflows.

I build applied ML/data systems and small developer-focused products — especially tools that turn messy real-world data or repetitive engineering workflows into something clearer, faster, and easier to act on.

My public work is split across a few areas:

  • Fraud / program-integrity analytics
  • Public-data investigation tooling
  • LLM-assisted workflows
  • Developer utilities and API tooling
  • Lightweight SaaS/product experiments

Featured project

Fraud-lead research pipeline over 11.4M public PPP loans, validated against real enforcement outcomes from DOJ/SBA-OIG sources.

This project focuses on turning public records into transparent, defensible investigative leads — not accusations.

Highlights:

  • Built a local analytical warehouse from public SBA/DOJ data
  • Used anomaly detection and ML experiments to rank potentially suspicious loans
  • Added positive-unlabeled learning and LightGBM experiments
  • Benchmarked signals against known prosecuted cases
  • Included bootstrap confidence intervals and clear model limitations
  • Built Streamlit views for analyst-style review
  • Added LLM-assisted entity resolution, retrieval, and similar-case workflows
  • Framed outputs responsibly as statistical leads, not proof of fraud

Stack: Python · DuckDB · pandas · scikit-learn · LightGBM · Streamlit · LLM workflows · graph/retrieval methods

Organizations / workstreams

I use GitHub organizations to separate different kinds of work instead of mixing every project into one personal account.

Developer tooling for app localization and translation workflows.

Shipi18n is focused on making internationalization easier for developers: translating locale files, preserving placeholders, supporting JSON/i18n workflows, and integrating translation into existing build or automation pipelines.

Examples of work in this area:

  • Translation APIs for developers
  • CLI tooling for locale-file translation
  • Vite/plugin-style integrations
  • Demo repos and framework examples
  • Automation around i18n files, placeholders, and translation memory

Developer tooling around API specs, structured outputs, and engineering workflow automation.

This workstream is for tools that help developers move faster around specs, schemas, generated code, API documentation, or related automation.

Examples of work in this area:

  • API/spec utilities
  • Schema and contract tooling
  • Developer workflow automation
  • Lightweight AI-assisted coding tools
  • Experiments around turning specs into usable project assets

Current focus

I am building public portfolio projects that show the kind of work I can discuss openly when client work cannot be shared in detail.

My emphasis is on:

  • Messy real-world data
  • Transparent assumptions
  • Reproducible pipelines
  • Honest validation
  • Practical analyst workflows
  • LLMs used as workflow support, not magic black boxes
  • Small tools that solve specific developer pain points

Selected stack

Python · SQL · DuckDB · pandas · scikit-learn · LightGBM · Streamlit · AWS · LLM APIs · LangChain · LangGraph · JavaScript/TypeScript · GitHub Actions

What I’m interested in

  • Fraud detection and program integrity
  • Public-sector and government data systems
  • Applied ML that survives honest evaluation
  • LLMs for retrieval, triage, entity resolution, and analyst workflows
  • Reproducible data pipelines
  • Developer tools and workflow automation
  • API/spec tooling
  • Lightweight SaaS products

Popular repositories Loading

  1. shipi18n-demo shipi18n-demo Public

    Forked from Shipi18n/shipi18n-demo

    Fork this repo to see autonomous localization in 2 minutes

    HTML

  2. tsmend tsmend Public archive

    LLM-driven TypeScript error repair. Layer 2-4 companion to @shipispec/tsfix.

    TypeScript

  3. relief-probe relief-probe Public

    Finding fraud leads in 11.4M public PPP loans, validated against real DOJ prosecutions — honest about what works and what doesn't.

    Python

  4. owgreen-dev owgreen-dev Public