Skip to content

MLM2/astra-ai-oversight-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Enabled Infrastructure Intelligence — Governance & Oversight Demo (ASTRA-aligned)

A SETA-style demonstration repo showing how to design governance, oversight, and analyst-in-the-loop validation around AI-enabled geospatial + text analytics using open-source data only.

This is intentionally not a “maximize accuracy” ML project. The value is the governance layer: confidence gating, review checkpoints, explainability notes, a risk register, and audit logging.

What it does (end-to-end)

  1. Acquire open data for an Area of Interest (AOI)
    • OpenStreetMap (OSM) features for infrastructure and (optionally) publicly tagged military landuse
    • Open building footprints (e.g., Microsoft Global ML Building Footprints)
  2. Score “facility candidates” using simple, explainable heuristics
    • building density / footprint area
    • proximity to roads / runways / ports (if present)
  3. Extract entities & relationships from open reporting text (demo corpus)
    • organizations, units, locations, dates
  4. Link evidence in a lightweight graph
    • facilities ↔ organizations/units ↔ reporting snippets
  5. Apply governance gates
    • confidence thresholds
    • mandatory human review when risk triggers fire
    • model/data documentation
    • audit logs of decisions and overrides

Safety note

This repo is designed for governance demonstration. It does not provide targeting guidance, operational analysis, or instructions to identify sensitive sites. Keep AOIs small and non-sensitive; stick to publicly available, non-operational use cases.

Repo map

  • src/astra_demo/data/ — data acquisition + normalization
  • src/astra_demo/models/ — simple, explainable scoring models
  • src/astra_demo/nlp/ — entity extraction utilities
  • src/astra_demo/graph/ — graph build/export
  • src/astra_demo/governance/ — thresholds, risk triggers, audit logging
  • docs/governance/ — model card, data sheet, risk register, review checklist
  • notebooks/ — step-by-step walkthrough

Quickstart (local)

python -m venv .venv
source .venv/bin/activate  # Windows: .venv\\Scripts\\activate
pip install -r requirements.txt

# 1) set AOI and pull OSM features (read-only)
python -m astra_demo.cli fetch-osm --aoi "Arlington, VA" --out data/raw/osm.geojson

# 2) run a simple facility scoring pass
python -m astra_demo.cli score-facilities --osm data/raw/osm.geojson --out data/processed/facility_scores.parquet

# 3) run NER on the included demo text corpus
python -m astra_demo.cli extract-entities --in data/raw/demo_corpus.jsonl --out data/processed/entities.jsonl

# 4) build an evidence graph
python -m astra_demo.cli build-graph --facilities data/processed/facility_scores.parquet --entities data/processed/entities.jsonl --out reports/evidence_graph.graphml

# 5) generate governance artifacts (risk register snapshot + audit log starter)
python -m astra_demo.cli governance-snapshot --out reports/governance_snapshot

Data & attribution (open-only)

Suggested open datasets and their licensing references:

  • OpenStreetMap data is available under the ODbL (Open Data Commons Open Database License).
  • Microsoft Global ML Building Footprints is released under ODbL.
  • Copernicus Sentinel data is available on a free, full, and open basis.
  • SpaceNet datasets provide open geospatial ML benchmarks (useful if you later want to swap in imagery-based baselines).

See docs/governance/DATA_SHEET.md for an attribution checklist.

Why this is ASTRA/SETA-relevant

  • Demonstrates AI assurance thinking (model risk, drift, false positives)
  • Shows how to integrate analyst validation instead of “black box automation”
  • Produces artifacts a government lead can review: risk register, model card, SOP-style checklists, auditable decisions

About

AI governance + analyst-in-the-loop oversight demo using open-source geo/text data

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages