AI Red Team Risk Register

AI-assisted security analysis tool that generates architecture-grounded attack scenarios and supports human review and risk decisions.

This project explores how AI can accelerate security analysis while keeping critical risk decisions under human control.

Given a system architecture description, the agent:

extracts the risk surface
generates a risk register of attack scenarios
models risk progression
drafts defense playbooks
routes scenarios into a human Review & Decide workflow

The result is a structured risk register that security teams can review, validate, and act on.

Demo Flow

The intended demo flow:

Select an architecture preset or paste an architecture description.
Click Generate Risk Register.
The AI agent:
- extracts assets, trust boundaries, entry points, and controls
- generates 6 architecture-grounded risk scenarios
Each scenario includes:
- MITRE ATT&CK mapping
- likelihood and impact
- risk score
- confidence estimate
- attack chain
- defense playbook
Scenarios move into the Review & Decide queue where a human analyst decides whether to:
- validate the scenario
- reject it
- request further investigation

This creates a clear separation between AI analysis and human risk ownership.

Why Human Review Matters

Security risk decisions cannot be fully automated.

The system intentionally stops before the final decision.
A human analyst must review each scenario and determine whether it should enter the validated risk register.

Reasons include:

AI can misinterpret architecture context
risk tolerance is a business decision
security teams must control prioritization and ownership

The tool accelerates analysis, but humans remain accountable for risk decisions.

System Architecture

The application is structured as a lightweight AI agent pipeline.

Flow:

Architecture Description
→ Risk Surface Extraction
→ Scenario Generation
→ Attack Chain Modeling
→ Defense Playbook Drafting
→ Review & Decide (Human)

Key design goals:

transparent reasoning
schema-validated AI outputs
safe security analysis (no exploit instructions)
human-in-the-loop governance

Example Scenario Output

Each generated scenario includes structured security metadata.

Example fields:

title
severity
MITRE tactic and technique
attack vector
attack chain
business impact
evidence from architecture
assumptions
confidence score
likelihood
impact
risk score
recommended defense playbook

This format mirrors a typical security risk register used in real organizations.

Key Features

AI-assisted risk register generation
architecture-grounded scenario analysis
MITRE ATT&CK mapping
structured risk scoring
confidence estimates for each scenario
defense playbook suggestions
human review and governance workflow
schema-validated LLM outputs

Tech Stack

Frontend

Next.js
React
TypeScript
TailwindCSS

Backend / Agent

Next.js API Routes
Google Gemini API
Zod schema validation

Other

Server-Sent Events for agent streaming
structured scenario normalization
PDF export for red team reports

Safety Guardrails

The AI agent is intentionally restricted to risk analysis only.

The system will not generate:

exploit payloads
step-by-step attack instructions
operational hacking guidance

Instead it focuses on:

architectural risk conditions
detection signals
mitigation strategies

Example Use Cases

Security architecture review
Red team preparation
Risk register creation
Threat modeling acceleration
Security design reviews

What Would Break at Scale

The first scaling challenge would likely be LLM latency and cost when generating scenarios for large architectures.

Potential solutions include:

architecture chunking
risk surface caching
model batching
asynchronous generation pipelines

Another challenge would be review workflow management once hundreds of scenarios accumulate, requiring queue prioritization and ownership tracking.

Running Locally

Install dependencies:

npm install
npm run dev

Add your API key:

GEMINI_API_KEY=your_key_here

Future Improvements

improved architecture parsing
multi-agent threat modeling
scenario deduplication
risk prioritization models
collaborative review workflows
SOC and SIEM integration

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.env.local.example		.env.local.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flow.png		flow.png
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Red Team Risk Register

Demo Flow

Why Human Review Matters

System Architecture

Example Scenario Output

Key Features

Tech Stack

Frontend

Backend / Agent

Other

Safety Guardrails

Example Use Cases

What Would Break at Scale

Running Locally

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Red Team Risk Register

Demo Flow

Why Human Review Matters

System Architecture

Example Scenario Output

Key Features

Tech Stack

Frontend

Backend / Agent

Other

Safety Guardrails

Example Use Cases

What Would Break at Scale

Running Locally

Future Improvements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages