Automap

Agentic Knowledge Graph Generation

Overview

Automap is an agentic pipeline that leverages Large Language Models (LLMs) and LangGraph to automate the creation of RML mappings and Knowledge Graph materialisation. The system uses a decentralised multi-agent architecture to analyse CSV schemas, scout ontologies, align schemas, generate Competency Questions (CQs), iteratively refine YARRRML mappings, and validate the final KG with both SPARQL and SHACL — all without manual intervention.

stateDiagram-v2
    [*] --> analyze_schema
    analyze_schema --> scout_ontology
    scout_ontology --> map_semantics
    map_semantics --> align_schema
    align_schema --> generate_cqs
    generate_cqs --> generate_yarrrml

    note right of generate_yarrrml
        Decentralised: PrefixAgent + EntityAgent (parallel)
        + RelationshipAgent (sequential)
        Prefix KV-cache reuse across retries
    end note

    generate_yarrrml --> validate_yarrrml

    state validate_yarrrml <<choice>>
    validate_yarrrml --> generate_yarrrml : Syntax Error (retries < RETRY_SYNTAX_MAX)
    validate_yarrrml --> [*] : Syntax Error (cap reached)
    validate_yarrrml --> refine_logic : No Syntax Error

    state refine_logic <<choice>>
    refine_logic --> generate_yarrrml : Logic Error (retries < RETRY_LOGIC_MAX)
    refine_logic --> [*] : Logic Error (cap reached)
    refine_logic --> generate_kg : No Logic Error

    generate_kg --> shacl_validate

    state shacl_validate <<choice>>
    shacl_validate --> generate_yarrrml : SHACL Violations (retries < RETRY_SHACL_MAX)
    shacl_validate --> sparql_validate_cqs : Conforms / --shacl not set / cap reached

    state sparql_validate_cqs <<choice>>
    sparql_validate_cqs --> generate_yarrrml : CQ Failures (retries < RETRY_CQ_MAX)
    sparql_validate_cqs --> align_schema : CQ Failures (deep retry)
    sparql_validate_cqs --> [*] : All CQs Pass / cap reached

Key Features

Decentralised YARRRML Generation: YARRRML generation is split across three specialised agents — PrefixAgent, EntityAgent, and RelationshipAgent. The prefix and entity agents run in parallel, reducing overall generation time. Prefix declarations are reused via KV-caching so the prefix block is not regenerated on every retry.
Competency Question (CQ) Validation: Auto-generates CQs from the schema and ontology, or accepts user-provided CQs. CQs are translated to SPARQL ASK queries by an LLM agent and executed against the materialised KG using an in-process pyoxigraph store — no external SPARQL endpoint, username, or port configuration required.
SHACL Validation (Astrea + rdflib fallback): Optional --shacl flag generates ontology-derived shapes using a three-tier strategy: (1) Astrea REST API, (2) local rdflib-based shape generation from OWL class/property declarations, (3) structural correctness shapes as a last resort. Violations trigger a targeted re-generation loop.
Multi-Agent Orchestration: Specialised agents for schema analysis, ontology scouting, semantic mapping, schema alignment, CQ generation, YARRRML architecture, logic refinement, and SPARQL validation.
Self-Correction Loop: Automatic syntax validation and logical refinement with up to 10 syntax retries and 6 logic retries.
Schema Alignment: Detects multi-node vs. flat mapping structures; auto-injects missing columns and prevents disconnected mappings.
SPARQL Direct Validation: Pass raw ASK queries via --sparql for deterministic checks (no LLM translation).
Base URI Control: Override the subject namespace with --base-uri or the BASE_URI environment variable.
Multi-Level Evaluation: Post-run evaluation covering pipeline success metrics, gold-standard KG comparison (precision/recall/F1), column coverage, and CQ coverage.
Terminal-Native Observability: Real-time streaming of agent states, stage timings, and reasoning directly to the console.
Native Docker Support: Pre-configured environment with automated compatibility patches.

Pipeline Stages

Stage	Description
Schema Analysis	Extracts column names, sample values, and infers data types from the input CSV.
Ontology Scout	Parses the provided ontology and identifies relevant classes, object properties, and data properties.
Semantic Mapper	Maps CSV columns to ontology concepts using LLM reasoning.
Schema Alignment	Determines flat vs. multi-node structure; plans entity subjects and cross-references.
CQ Generator	Auto-generates Competency Questions or uses user-supplied ones; saves to `cqs.txt`.
YARRRML Generator	Decentralised generation: PrefixAgent and EntityAgent run in parallel, then RelationshipAgent adds cross-mapping links. Prefix KV-cache is reused across retries.
Syntax Validator	Validates YARRRML syntax via `yatter`; retries on failure.
Logic Refiner	Checks for structural issues (disconnected mappings, missing columns, wrong datatypes); retries on failure.
KG Generator	Materialises the KG from the final YARRRML using `morph-kgc`.
SHACL Validator	(Optional, `--shacl`) Generates ontology-derived shapes via Astrea → rdflib local → structural fallback; validates the KG with pyshacl; retries on violations.
SPARQL CQ Validator	Translates CQs to SPARQL `ASK` queries (LLM) and executes them against an in-process pyoxigraph store; retries on failures with structured feedback.

Observability & Debugging

While LangGraph is open-source, its primary visualisation tool, LangSmith, often presents limitations:

Tier Constraints: Free tiers have strict trace limits and data retention periods.
Privacy & Latency: Sending agent traces to a third-party cloud is not always feasible.
Complexity: Setup requires API keys and external dashboard management.

The "Terminal-First" Approach

To keep this project lightweight and independent, we use Native Terminal Streaming. The pipeline uses a custom event-loop in main.py to provide real-time feedback:

Live Stage Tracking: See exactly which node is active along with its elapsed time.
Stage Timing Table: End-of-run summary showing time and relative progress bar for each stage.
Logic Refinement Feedback: The Logic Refiner agent prints its specific structural critique directly to your terminal.
Syntax Validation: Instant PASS/FAIL status reports with error excerpts.
SHACL Results: Inline violation count, shapes source (Astrea / rdflib / structural), and retry routing decisions.
CQ Validation: Per-question PASS/FAIL breakdown with SPARQL retry status.

Installation & Setup

This project uses uv for fast Python dependency management and docker for containerised execution.

1. Local Environment Setup

Ensure you have uv installed.

# Sync dependencies
uv sync

# Activate the virtual environment
source .venv/bin/activate  

# Apply essential Morph-KGC compatibility patches
# NOTE: This script is currently optimised for Linux.
bash scripts/patch_morph_kgc.sh

# Set up your environment variables
cp .env.example .env  # Edit with your LLM API keys and file paths

2. Environment Variables

Copy .env.example to .env and fill in your values. All variables are optional unless marked required.

Provider & connection

Variable	Description	Default
`LLM_PROVIDER`	LLM backend — `lm_studio` or `comet`	`lm_studio`
`LM_STUDIO_URL`	Base URL of your LM Studio server	`http://localhost:1234/v1`
`LANGSMITH_API_KEY`	LangSmith tracing key (optional)	—

Model selection

Variable	Description	Default
`LLM_MODEL_DEFAULT`	Single knob — model used by all agents unless overridden	`qwen2.5-coder-14b-instruct`
`LLM_MODEL_SCHEMA`	Override for schema analysis agent	`LLM_MODEL_DEFAULT`
`LLM_MODEL_MAPPER`	Override for semantic mapper agent	`LLM_MODEL_DEFAULT`
`LLM_MODEL_ALIGNMENT`	Override for schema alignment agent	`LLM_MODEL_DEFAULT`
`LLM_MODEL_YARRRML`	Override for prefix / entity / relationship agents	`LLM_MODEL_DEFAULT`
`LLM_MODEL_CQ`	Override for CQ generator & SPARQL translator	`LLM_MODEL_DEFAULT`
`LLM_MODEL_REFINER`	Override for logic refiner	`LLM_MODEL_DEFAULT`

Tip: To run a hybrid model configuration (e.g. DeepSeek for schema reasoning + Qwen for YARRRML generation), simply uncomment and set the relevant LLM_MODEL_* lines in your .env — no code changes needed.

Temperature & timeout

Variable	Description	Default
`LLM_TEMP_SCHEMA`	Sampling temperature for schema agent	`0.3`
`LLM_TEMP_MAPPER`	Sampling temperature for mapper agent	`0.3`
`LLM_TEMP_ALIGNMENT`	Sampling temperature for alignment agent	`0.2`
`LLM_TEMP_YARRRML`	Sampling temperature for YARRRML agents	`0.3`
`LLM_TEMP_CQ`	Sampling temperature for CQ/SPARQL agent	`0.2`
`LLM_TEMP_REFINER`	Sampling temperature for refiner	`0.2`
`LLM_TIMEOUT`	Global request timeout in seconds (`0` = use per-role defaults)	`0`

Input / output

Variable	Description	Default
`INPUT_CSV_PATH`	Required. Path to the input CSV file	—
`INPUT_ONTOLOGY_PATH`	Required. Path to the input ontology (Turtle)	—
`BASE_URI`	Base namespace for all generated subject URIs	`http://mykg.org/resource/`

3. Execution

# Basic run
uv run python main.py

# Run with SHACL validation (ontology-derived shapes: Astrea → rdflib → structural)
uv run python main.py --shacl

# Custom subject URI namespace
uv run python main.py --base-uri http://mykg.org/resource/

# User-provided Competency Questions
uv run python main.py --cqs "Which films exist?" "Who directed each film?"

# CQs from a file (one per line)
uv run python main.py --cqs @my_cqs.txt

# Direct SPARQL ASK validation (no LLM translation)
uv run python main.py --sparql "ASK { ?s a <http://dbpedia.org/ontology/Film> }"

# Full evaluation (all levels) with gold KG comparison
uv run python main.py --eval 1 2 3 --gold data/gold/my_gold.nt


# Combined: SHACL + user CQs + evaluation
uv run python main.py --shacl --cqs @cqs.txt --eval 1 2 3

4. Running via Docker (Recommended)

# Default run (reads INPUT_CSV_PATH / INPUT_ONTOLOGY_PATH from .env)
docker-compose up --build

# Pass extra CLI flags via the docker-compose `command:` key in docker-compose.yml:
#   command: ["--shacl", "--sparql", "--eval", "1", "2", "3"]

# Or inline with docker run:
docker run --rm --env-file .env -v $(pwd)/data:/app/data llm-agents_rml --shacl

The Dockerfile uses an ENTRYPOINT so any arguments appended to docker run or set in docker-compose.yml under command: are forwarded directly to main.py. Python 3.12 / Pandas 2.0 / NumPy 2.0 compatibility patches for morph-kgc are applied automatically at build time.

Evaluation Levels

Run post-pipeline evaluation with --eval:

Level	Description
1	Pipeline success metrics: YARRRML produced, syntactically valid, translatable, KG materialised, retry count, triple count, latency.
2	Gold-standard KG comparison: normalised triple match (precision/recall/F1), schema-level predicate/class comparison, hallucinated vs. missing predicates.
3	Column coverage: YARRRML template references vs. literal value match in the first CSV row.
4	CQ/SPARQL validation coverage (always included when CQs are present).

# Level 1 only
uv run python main.py --eval 1

# All levels with a gold KG
uv run python main.py --eval 1 2 3 --gold data/gold/bikeshare_gold.nt

Metrics are saved as eval_metrics.json in the run directory.

SHACL Validation

When --shacl is passed, the pipeline generates SHACL shapes using a three-tier strategy (dataset-agnostic — no hard-coded rules):

Tier	Source	Shapes generated
1. Astrea	Remote REST API (`https://astrea.linkeddata.es`)	Full OWL-to-SHACL derivation
2. rdflib local	Ontology file parsed locally	NodeShapes per class; IRI constraints for object properties; Literal constraints for datatype properties
3. Structural fallback	Built-in	Subjects of `rdf:type` must be IRIs; objects of `rdf:type` must be IRIs

All tiers use inference="none" in pyshacl to prevent RDFS-inferred false positives (typed literals being flagged as IRI violations).

Decentralised YARRRML Generation

Previously, the entire YARRRML document was produced by a single monolithic agent. The architecture has been refactored into three specialised sub-agents:

Agent	Responsibility	Execution
PrefixAgent	Declares the `prefixes:` block	Parallel with EntityAgent
EntityAgent	Generates `mappings:` with data properties	Parallel with PrefixAgent
RelationshipAgent	Adds object-property PO entries (cross-mapping links)	Sequential, after assembly

Prefix KV-caching: On retry iterations, the prefix block is reused from the previous attempt (unless the entity plan introduces new namespaces), avoiding redundant token generation for an unchanged prefix set.

Base URI enforcement: When --base-uri is set, the coordinator rewrites both subject (s:) URI templates and IRI object templates in po: entries (e.g. dbo:Person/$(person_id)~iri → mykg:Person/$(person_id)~iri) so the generated KG's entity URIs consistently reflect the user's own namespace across all triples.

CQ Validation with SPARQL and Pyoxigraph

Why Competency Questions?

A CSV schema alone does not determine a unique KG — the same columns can be mapped dozens of valid ways. Automap resolves this under-determination through Competency Questions (CQs): natural-language questions that articulate what the KG must be able to answer. CQs serve a dual role:

Generative bias (pre-materialisation): CQs are injected directly into the EntityAgent prompt as hard constraints — "The mapping MUST produce a KG that can answer ALL of these Competency Questions" — steering the generator toward the entity types, predicates, and join structures the intended use-case actually requires, before any triple is materialised.
Semantic validation target (post-materialisation): CQs are translated to SPARQL ASK queries and executed against the real KG to verify that the bias was effective. Failures feed structured diagnostic context back to the YARRRML generator for targeted re-generation.

When the user does not provide CQs (via --cqs), the generate_cqs node auto-generates them from the CSV schema and ontology, ensuring every pipeline run is guided and validated semantically, not just syntactically.

Validation flow

CQs are generated automatically from the schema and ontology (or supplied by the user via --cqs).
Each CQ is translated to a SPARQL ASK query by a dedicated LLM agent, grounded with actual classes and predicates probed from the materialised KG.
Queries are executed against an in-process pyoxigraph store — no external SPARQL endpoint, no localhost port, no credentials required.
Failed CQs trigger structured feedback to the YARRRML generator, identifying which triple patterns are absent from the KG.

Post-Install Patches (Compatibility Note)

The upstream dependency morph-kgc requires specific patches to support Python 3.12, Pandas 2.0+, and Numpy 2.0+.

Important

Platform Support: The scripts/patch_morph_kgc.sh script is currently Linux-only.

macOS Users: You may need to install gnu-sed or manually adjust the sed -i commands in the script.
Windows Users: Please use the Docker installation or manually apply the changes listed below in your site-packages.

File	Issue	Fix
`mapping_partitioner.py`	`value_counts()` index access	`.value_counts()[0]` → `.value_counts().iloc[0]`
`utils.py`	`np.NaN` alias removal	`np.NaN` → `np.nan`

Repository Structure

automap/
├── main.py                          # Entry-point: CLI argument parsing, pipeline invocation,
│                                    #   real-time stage streaming, evaluation dispatch
│
├── agents/                          # All LLM agent logic
│   ├── schema_agent.py              #   Extracts column names, sample values, and inferred types from CSV
│   ├── mapper_agent.py              #   Maps CSV columns → ontology classes/properties (semantic mapper)
│   ├── schema_alignment_agent.py    #   Decides flat vs. multi-node structure; plans entity subjects
│   ├── ontology_entity_planner.py   #   Intermediate entity plan used by the alignment agent
│   ├── cq_generator_agent.py        #   Auto-generates Competency Questions from schema + ontology
│   ├── cq_to_sparql_agent.py        #   Translates CQs → SPARQL ASK queries (grounded on live KG probe)
│   ├── prefix_agent.py              #   Generates the YARRRML `prefixes:` block (runs in parallel)
│   ├── entity_agent.py              #   Generates the `mappings:` block with data properties (runs in parallel)
│   ├── relationship_agent.py        #   Adds object-property PO entries / cross-mapping joins (sequential)
│   ├── yarrrml_coordinator.py       #   Orchestrates Prefix + Entity (parallel) → Relationship (sequential)
│   └── refiner_agent.py             #   Structural logic checker: disconnected mappings, missing columns,
│                                    #   wrong datatypes, phantom mappings; auto-fixes where possible
│
├── graph/                           # LangGraph workflow definitions
│   ├── workflow.py                  #   StateGraph topology: node wiring + all routing functions
│   │                                #   (retry caps read from RETRY_* env vars via config/settings.py)
│   └── nodes.py                     #   All node implementations: schema, ontology, mapper, alignment,
│                                    #   CQ generation, YARRRML generation, syntax validation, logic
│                                    #   refinement, KG generation, SHACL validation, SPARQL CQ validation
│
├── config/                          # Shared configuration
│   ├── settings.py                  #   LLM factory (get_llm / get_llm_with_retry), per-role model &
│   │                                #   temperature resolution, retry limit constants (RETRY_*_MAX)
│   ├── structured_output.py         #   Pydantic schemas for structured LLM outputs (MappingsOutput, etc.)
│   ├── yarrrml_examples.py          #   Few-shot YARRRML examples injected into agent prompts
│   └── prefixes.py                  #   Curated ontology prefix registry (dbo, schema, foaf, …)
│
├── data/                            # Data layer
│   ├── checkpoints.py               #   AgentState TypedDict: all pipeline state fields
│   ├── input/                       #   Input datasets (CSV files + Turtle ontologies)
│   │   └── <dataset>/
│   │       ├── data.csv
│   │       └── ontology.ttl
│   ├── gold/                        #   Gold-standard KGs for Level 2 evaluation (N-Triples)
│   └── output/                      #   Timestamped run directories (auto-created)
│       └── run_YYYYMMDD_HHMMSS/
│           ├── final_mapping.yaml   #     Final accepted YARRRML mapping
│           ├── knowledge_graph.nt   #     Materialised KG (N-Triples)
│           ├── cqs.txt              #     Competency Questions used in this run
│           ├── sparql_validation.txt#     CQ validation report (human-readable)
│           ├── sparql_validation.json#    CQ validation report (machine-readable)
│           ├── shacl_shapes.ttl     #     SHACL shapes used (only when --shacl)
│           ├── shacl_report.txt     #     SHACL validation report (only when --shacl)
│           ├── eval_metrics.json    #     Evaluation metrics (only when --eval)
│           └── debug/               #     Per-attempt YARRRML snapshots (attempt_1.yaml, …)
│
├── evaluation/                      # Multi-level post-run evaluation framework
│   ├── metrics.py                   #   Computes L1–L4 metrics (triple match, F1, column coverage, CQ %)
│   ├── run_experiment.py            #   Batch experiment runner (multiple CSV inputs, aggregated results)
│   └── analyze_results.py           #   Aggregates and prints experiment result tables
│
├── validation_hofer-et-al/          # Reproducibility benchmark (Hofer et al. comparison)
│   ├── compare_my_pipeline.py       #   Side-by-side F1 comparison against the GPT-4 reference pipeline
│   ├── README.md           #   Step-by-step reproduction guide + expected output
│   └── target/                      #   Comparison output artefacts (comparison_results.json)
│
├── tools/                           # Developer utilities (not part of the core pipeline)
│   └── rml_tools.py                 #   Helpers for RML/YARRRML inspection and debugging
│
├── scripts/
│   └── patch_morph_kgc.sh           # Applies Python 3.12 / Pandas 2.0 / NumPy 2.0 compatibility
│                                    #   patches to the morph-kgc site-packages (Linux only)
│
├── Dockerfile                       # Multi-stage Docker image (Python 3.12, patches applied at build)
├── docker-compose.yml               # Compose file — mounts ./data, passes .env, forwards CLI args
├── pyproject.toml                   # Project metadata and uv/pip dependencies
├── uv.lock                          # Locked dependency manifest (uv)
├── langgraph.json                   # LangGraph Studio configuration
├── .env.example                     # Environment variable template (copy to .env)
└── .gitignore

Research & Citations

If you use this tool in an academic context, please cite:

Morph-KGC

Arenas-Guerrero, J., et al. (2024). An RML-FNML module for Python user-defined functions in Morph-KGC. SoftwareX.
Arenas-Guerrero, J., et al. (2024). Morph-KGC: Scalable knowledge graph materialisation with mapping partitions. Semantic Web.

Yatter

Iglesias-Molina, A., et al. (2023). Human-Friendly RDF Graph Construction: Which One Do You Chose?. ICWE.

Astrea

Cimmino, A., et al. (2020). Astrea: Automatic Generation of SHACL Shapes from Ontologies. ESWC.

Acknowledgments

Funding

This work has received funding from the PIONERA project (Enhancing interoperability in data spaces through artificial intelligence), a project funded in the context of the call for Technological Products and Services for Data Spaces of the Ministry for Digital Transformation and Public Administration within the framework of the PRTR funded by the European Union (NextGenerationEU).

Contributors

Naveen Varma KALIDINDI - naveen.kalidindi@upm.es

Universidad Politécnica de Madrid (UPM)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automap

Overview

Key Features

Pipeline Stages

Observability & Debugging

The "Terminal-First" Approach

Installation & Setup

1. Local Environment Setup

2. Environment Variables

Provider & connection

Model selection

Temperature & timeout

Input / output

3. Execution

4. Running via Docker (Recommended)

Evaluation Levels

SHACL Validation

Decentralised YARRRML Generation

CQ Validation with SPARQL and Pyoxigraph

Why Competency Questions?

Validation flow

Post-Install Patches (Compatibility Note)

Repository Structure

Research & Citations

Acknowledgments

Funding

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
agents		agents
config		config
data		data
evaluation		evaluation
graph		graph
scripts		scripts
tools		tools
validation_hofer-et-al		validation_hofer-et-al
.env		.env
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
langgraph.json		langgraph.json
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Automap

Overview

Key Features

Pipeline Stages

Observability & Debugging

The "Terminal-First" Approach

Installation & Setup

1. Local Environment Setup

2. Environment Variables

Provider & connection

Model selection

Temperature & timeout

Input / output

3. Execution

4. Running via Docker (Recommended)

Evaluation Levels

SHACL Validation

Decentralised YARRRML Generation

CQ Validation with SPARQL and Pyoxigraph

Why Competency Questions?

Validation flow

Post-Install Patches (Compatibility Note)

Repository Structure

Research & Citations

Acknowledgments

Funding

Contributors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages