🦒 Girafon : ESRS gap lint for sustainability reports

ESRS gap lint for sustainability reports. It tells you what’s missing and shows the exact evidence.

Limitations Girafon is a diagnostic aid, not a compliance certificate. Use it for first‑pass review and validate with an auditor.

Upload an ESG PDF or HTML report and Girafon returns per‑disclosure status, cited quotes with page numbers, a weighted score, greenwashing signals, and a prioritised fix list. Omnibus mode is draft and intended for scenario analysis only.

Why this exists

ESRS compliance checks are slow, repetitive, and error‑prone. Girafon focuses on evidence‑first gap detection instead of trying to be a full compliance system. The tradeoff is intentional: less breadth, more traceability, faster review.

Framework‑aware means each check maps to a named ESRS disclosure. Evidence‑first means every finding is tied to a quote and page. Open source means you can audit what it does, and change it when regulations shift.

Installation

# 1. Clone
git clone https://github.com/monsieurr/girafon
cd girafon

# 2. Install dependencies
pip install -r requirements.txt

Usage (two ways)

You can use Girafon either:

From the terminal (CLI) for batch runs and automation
With a GUI (Streamlit) for demos and quick uploads

Choose your path

Use case	Best choice	Why
Batch runs, automation, CI	CLI (Terminal)	Fast, scriptable, easy to scale
Demos, non-technical users	GUI (Streamlit)	Upload + click, shareable

CLI : Terminal usage

Option A : Local LLM with Ollama (no API key needed)

# Install Ollama: https://ollama.com
ollama pull llama3.2      # or: mistral, gemma3, qwen2.5
ollama serve              # start the local server

cp .env.example .env
# In .env set:
#   LLM_PROVIDER=ollama
#   LLM_MODEL=llama3.2

python main.py --check               # verify connection
python main.py --pdf report.pdf      # run analysis

Option B : Cloud API (Gemini, Anthropic, OpenAI, Groq, Mistral)

cp .env.example .env
# In .env set your provider + API key, e.g.:
#   LLM_PROVIDER=anthropic
#   ANTHROPIC_API_KEY=sk-ant-...

python main.py --check               # verify connection
python main.py --pdf report.pdf

This produces an HTML report like report_report_YYYYMMDD_HHMMSS.html : open it in any browser.

GUI : Streamlit usage

Option C : Local Streamlit UI

streamlit run streamlit_app.py

Reports generated from the UI are saved under ./outputs by default. You can override this with GIRAFON_OUTPUT_DIR=/path/to/folder.

Option D : Streamlit Cloud (self-hosted UI)

Push this repo to your GitHub.
On Streamlit Cloud, create a new app from your repo.
Add secrets in App settings → Secrets (see .streamlit/secrets.toml.example).
Deploy. The app stays private to your account/team.

Note: Streamlit Cloud / Render / Hugging Face Spaces cannot access your local Ollama. For public hosting, use a cloud LLM provider (OpenAI/Anthropic/Groq/Mistral) via API keys.

Option E : Docker (self-hosted UI)

docker build -t girafon .
docker run -p 8501:8501 girafon

Using Ollama running on your host

If you want the container to use Ollama on your host machine, set OLLAMA_HOST so Girafon can reach it.

Linux (host networking):

docker run --network=host -e OLLAMA_HOST=http://127.0.0.1:11434 girafon

macOS / Windows (Docker Desktop):

docker run -p 8501:8501 -e OLLAMA_HOST=http://host.docker.internal:11434 girafon

Deployment options (self-hosted)

Girafon is designed for self-deploy. Pick the path that matches your audience:

Local (fastest): pip install -r requirements.txt + streamlit run streamlit_app.py
Docker (reproducible): build and run the container (see Option E)
Streamlit Cloud / Render / HF Spaces: use a cloud LLM provider, not local Ollama

If you want your visitors to deploy it themselves, these three options cover 99% of cases.

Public demo mode (anonymized, no LLM required)

If you want a public demo without exposing real company names, you can generate an anonymized static showcase from existing HTML reports.

Generate reports as usual (single or batch).
Build the demo bundle (pass --summary if you want the comparison workspace):

python site/build_demo_bundle.py \
  --input-dir ./out \
  --summary ./out/summary.json \
  --output-dir site/demo

This creates:

site/demo/index.html
site/demo/company_01_report.html
site/demo/company_02_report.html
site/demo/comparison.html

You can now publish the site/ folder on GitHub Pages, Netlify, or any static host. Only publish site/demo (do not publish outputs/, which still contains real company names).

Modes and profiles (what they mean)

ESRS mode

Original : ESRS Set 1 as adopted in 2023 (current official baseline).
Omnibus / Simplified (draft) : proposed simplifications; use for scenario analysis (not yet adopted law).

Note: Omnibus/Simplified mode is based on EFRAG technical advice (Dec 2025) and is not yet adopted law. The currently applicable standard remains ESRS Set 1 (Delegated Act EU 2023/2772).

Schema profile

Basic : 20 disclosures for a first‑pass gap check.
IG3-core : Girafon preset: ESRS 2 + E1 + G1.
IG3 full : EFRAG Implementation Guidance datapoint list (non-authoritative).

CLI options

python main.py --pdf <path>           # Required: path to ESG PDF
python main.py --input-dir <folder>  # Batch mode: folder of PDFs
             --output-dir <folder>   # Batch mode: output folder (default: <input>/girafon_out)
               --company "Name"       # Company name for the report header
               --output report.html   # Custom output path
               --json results.json    # Also save raw results as JSON
               --mode original        # "original" (ESRS 2023) or "omnibus" (draft; not adopted law)
               --chunk-words 500      # Chunk size in words (default: 500)
               --overlap-words 120    # Chunk overlap (default: 120)
               --min-chunk-words 40   # Discard very short chunks (default: 40)
               --schema basic         # "basic" (20 disclosures), "ig3-core" (ESRS2+E1+G1), or "ig3" (full datapoints)
               --taxonomy-map map.json # Optional ESRS taxonomy mapping file
             --diff-base <pdf>        # Diff mode: baseline report
             --diff-new <pdf>         # Diff mode: comparison report
             --diff-output <html>     # Diff mode: output HTML for diff report

Batch analysis (cross-report screening)

Batch mode scans a folder of PDFs, generates one HTML report per file, plus:

summary.json (aggregated metrics)
comparison.html (portable screening workspace)

Example:

python main.py --input-dir ./reports --output-dir ./out

Output folder structure:

out/
  company_a_report.html
  company_b_report.html
  summary.json
  comparison.html

The comparison page is a screening tool, not a benchmark.

Note: Batch mode forces a local LLM (Ollama) with qwen2.5:14b to avoid cloud API rate limits. Make sure Ollama is running before you launch batch runs.

ESRS XBRL taxonomy mapping (optional)

If you want JSON outputs to include ESRS taxonomy element IDs, build the mapping once:

python -m esg_analyzer.taxonomy.build_map \
  --annex /path/to/Annex-1-ESRS-Set1-XBRL-Taxonomy-illustrated-in-Excel.xlsx \
  --out esg_analyzer/frameworks/esrs_taxonomy_map.json

When esrs_taxonomy_map.json exists, CLI and Streamlit outputs automatically add taxonomy_elements to each disclosure in the JSON report.

IG3 full datapoint schema (optional)

IG3 contains 1,000+ datapoints and is much slower than the basic 20-disclosure run. For a smaller run with high regulatory coverage, use --schema ig3-core (ESRS 2 + E1 + G1).

python -m esg_analyzer.frameworks.build_ig3_schema \
  --ig3 /path/to/EFRAG_IG3_List_of_ESRS_Data_Points.xlsx \
  --out esg_analyzer/frameworks/esrs_ig3_schema.json \
  --base-schema esg_analyzer/frameworks/esrs_schema.json

python main.py --pdf report.pdf --schema ig3-core
python main.py --pdf report.pdf --schema ig3

Materiality-aware scoring (strict)

In IG3 mode, the tool runs a strict materiality scan: it only treats a topic as non-material if the report explicitly states so. When a topic is declared non-material, related datapoints are excluded from scoring and recommendations, while still being listed in the report with a note. Materiality matrices in markdown tables are also parsed (strictly) when they include explicit "Not material" columns with marks (e.g. X).

Security & Data Privacy

Self-hosted by default. Run locally or on your own Streamlit Cloud workspace.
No data leaves your machine when using a local LLM (Ollama).
API keys are never stored in the UI. Use .env or Streamlit secrets.
Advisory only. Outputs are not legal compliance certificates.

ESRS framework modes

Mode	Description	Scope
`original`	Full ESRS Delegated Act (EU) 2023/2772	Wave 1 companies (FY2024 reports)
`omnibus`	Simplified ESRS (draft; not adopted law)	Indicative scope only (see Omnibus materials)

Omnibus/Simplified mode is based on EFRAG technical advice (Dec 2025) and is not yet adopted law. The applicable standard remains ESRS Set 1 (Delegated Act EU 2023/2772). Use --mode omnibus only for forward‑looking scenario analysis.

Covered ESRS disclosures

Section	Name	Category	Mandatory (Original)	Mandatory (Omnibus)
E1-6	Scope 1 GHG Emissions	Climate	✅	✅
E1-6	Scope 2 GHG Emissions	Climate	✅	✅
E1-6	Scope 3 GHG Emissions	Climate	✅	✅
E1-6	GHG Emissions Intensity	Climate	✅	⬜ voluntary
E1-4	Climate Targets	Climate	✅	✅
E1-5	Energy Consumption & Mix	Climate	✅	✅
E1-1	Transition Plan	Climate	✅	⬜ voluntary
E2-4	Pollution to Air/Water/Soil	Pollution	materiality	materiality
E3-4	Water Consumption	Water	materiality	materiality
E4-1	Biodiversity Policies	Biodiversity	materiality	⬜ voluntary
E5-5	Waste & Resource Use	Circular Economy	materiality	materiality
S1-9	Gender Diversity	Workforce	✅	✅
S1-14	Health & Safety (LTIFR)	Workforce	✅	✅
S1-13	Training Hours	Workforce	✅	⬜ voluntary
S1-8	Collective Bargaining	Workforce	✅	✅
S2-1	Supply Chain Due Diligence	Value Chain	materiality	materiality
G1-1	Governance Structure	Governance	✅	✅
G1-3	Anti-Corruption Policy	Governance	✅	✅
G1-4	Corruption Incidents	Governance	✅	✅
G1-5	Political Lobbying	Governance	materiality	⬜ voluntary

Mandatory note: “Mandatory” here means mandatory if the topic is material. Only ESRS 2 Appendix B datapoints (SFDR/Pillar 3/EU Taxonomy-linked) are mandatory regardless of materiality.

Architecture

girafon/
├── main.py                              # CLI entry point
├── esg_analyzer/
│   ├── parsers/
│   │   └── pdf_parser.py               # PDF → overlapping text chunks + page numbers
│   ├── frameworks/
│   │   └── esrs_schema.json            # ESRS disclosures (Omnibus-aware)
│   ├── retrieval/
│   │   └── search.py                   # 3-tier retrieval (embeddings → TF-IDF → keyword)
│   ├── analysis/
│   │   ├── detector.py                 # LLM-powered disclosure detection
│   │   └── scorer.py                   # Deterministic weighted scoring
│   └── report/
│       └── generator.py               # Self-contained HTML report

Design principles:

Framework logic is pure data (JSON), never hardcoded
LLM is used only to evaluate evidence : scoring is deterministic
No vector DB required for MVP (embeddings/TF‑IDF/keyword retrieval in‑memory)
Modular: swap any component without touching others

Report UX Enhancements (Workspace Mode)

The HTML report now includes lightweight, client-side workspace features while preserving the original structure and printability:

Sticky navigation with anchors to major sections
Filters (missing / partial / found, mandatory, high priority)
Search across IDs, titles, missing datapoints, and quotes
Confidence badges (high / medium / low) based on evidence presence
Review workflow (to review / validated / dismissed) stored in localStorage
Export reviewed findings to CSV/JSON
Clearer evidence blocks (quote + rationale + source page)

These changes are purely presentational and do not alter the core data model or analysis logic. The report remains a standalone HTML file.

How scoring works

Each ESRS disclosure has a weight (1–10). The overall score is:

score = Σ(weight_i × status_value_i) / Σ(weight_i) × 100

where status_value: FOUND=1.0, PARTIAL=0.5, MISSING=0.0

Weights are configurable in esrs_schema.json.

Score bands:

Score	Band
80–100	✅ Excellent
60–79	🟡 Good
40–59	🟠 Needs Improvement
0–39	🔴 Weak / High Risk

Greenwashing detection

The tool flags vague or unsupported language patterns including:

"we aim to be net zero" without a baseline, timeline, or methodology
Net-zero claims without interim milestones
Heavy offset reliance without disclosure of offset type/standard
Scope 3 total disclosed but categories not enumerated
Targets without a baseline year

Limitations

This tool is advisory only : it is not a legal compliance certificate
Accuracy depends on PDF text extraction quality (scanned PDFs may need OCR)
LLM evaluation can produce false positives/negatives; always human-review the output
ESRS rules are subject to ongoing regulatory change; schema will be updated

Roadmap

Benchmarking vs industry peers
Year-over-year trend tracking
Semantic search upgrade (sentence-transformers + FAISS)
GRI / ISSB cross-mapping
Batch processing for multiple companies
Streamlit web UI (basic)

Contributing

PRs welcome, especially:

Additional ESRS disclosures in esrs_schema.json
Better keyword lists for existing disclosures
Sample anonymised test reports

License

MIT : free to use, modify, and distribute.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.streamlit		.streamlit
benchmarks		benchmarks
esg_analyzer		esg_analyzer
outputs/batch_run		outputs/batch_run
public/screenshots		public/screenshots
site		site
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Folders and files

Latest commit

History

Repository files navigation

🦒 Girafon : ESRS gap lint for sustainability reports

Why this exists

Installation

Usage (two ways)

CLI : Terminal usage

Option A : Local LLM with Ollama (no API key needed)

Option B : Cloud API (Gemini, Anthropic, OpenAI, Groq, Mistral)

GUI : Streamlit usage

Option C : Local Streamlit UI

Option D : Streamlit Cloud (self-hosted UI)

Option E : Docker (self-hosted UI)

Using Ollama running on your host

Deployment options (self-hosted)

Public demo mode (anonymized, no LLM required)

Modes and profiles (what they mean)

CLI options

Batch analysis (cross-report screening)

ESRS XBRL taxonomy mapping (optional)

IG3 full datapoint schema (optional)

Materiality-aware scoring (strict)

Security & Data Privacy

ESRS framework modes

Covered ESRS disclosures

Architecture

Report UX Enhancements (Workspace Mode)

How scoring works

Greenwashing detection

Limitations

Roadmap

Contributing

License

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages