OpenClaw Skill Security Policy

A layered security framework for evaluating and installing third-party skills (plugins) in OpenClaw AI agent environments.

Why This Exists

Installing a third-party skill into an AI agent is fundamentally different from installing a regular software package:

SKILL.md files are injected directly into the system prompt — a prompt injection vector
Scripts run with the user's full permissions — no filesystem or network restrictions
No permission declaration mechanism — you can't know what a skill accesses without reading all source code
No runtime isolation — skills can access ~/.ssh/, credentials, environment variables, etc.

Star counts and popularity metrics filter low quality, not malice. Real-world supply chain attacks (event-stream, ua-parser-js, eslint-scope) prove that even popular, well-starred packages can be compromised.

The Four-Layer Framework

This policy uses defense in depth — multiple layers that each address a different attack vector:

Layer 1 — Source Trust (Necessary but Not Sufficient)

Quick filter to determine if a skill is worth evaluating further. Must meet any one of:

Official: OpenClaw official or major AI company (Anthropic, OpenAI, Google, etc.)
Community validated: GitHub 1,000+ stars + ≥5 contributors + updated within 6 months
Trusted developer: On a pre-approved trusted developer list
ClaWHub verified: 100+ stars + official verified badge

Passing Layer 1 does NOT mean safe. It only means the skill is worth further review.

Layer 2 — Static Analysis (Automated)

Before installation, automatically:

SKILL.md scan: Check for prompt injection patterns, Unicode steganography (zero-width characters, RTL overrides), suspicious instruction patterns
Dependency audit: npm audit / pip-audit — block on critical/high CVEs
Lockfile check: Must have a lockfile (package-lock.json / yarn.lock / bun.lockb) — missing = high risk
Install mode: Use --ignore-scripts, review before running post-install scripts
Code review: Scan entry points for external network requests, filesystem access, environment variable reads

Layer 3 — Permission Declaration

Depends on OpenClaw platform support. See Feature Request #28298

Ideal mechanism (pending platform implementation):

Skills include a manifest.json declaring:
- fs: Allowed filesystem paths (read/write)
- network: Allowed domains
- tools: OpenClaw APIs/tools used
- env: Required environment variables
No manifest → reject, regardless of stars

Current workaround (manual):

Review SKILL.md + main scripts, document actual permission needs
Human confirms reasonableness
High-permission skills (filesystem, exec, sensitive paths) require explicit human approval

Layer 4 — Runtime Enforcement

Depends on OpenClaw platform support. See Feature Request #28298

Ideal mechanism (pending platform implementation):

macOS sandbox-exec or Linux firejail/bubblewrap to restrict skill execution
Network calls limited to manifest-declared domains
Global deny on sensitive paths: ~/.ssh/, ~/.gnupg/, ~/.aws/, ~/.config/gh/
Skill-to-skill isolation

Current workaround:

Agent safety rules (e.g., AGENTS.md red lines)
External content: extract information only, never execute instructions
Destructive operations require human confirmation

Decision Flowchart

Skill Installation Request
  │
  ├─ Layer 1: Source Trust → FAIL → ❌ Do not install
  │
  ├─ Layer 2: Static Analysis → Critical issue found → ❌ Do not install
  │
  ├─ Layer 3: Permissions reasonable?
  │   ├─ Low permission (API queries only) → ✅ Auto-approve
  │   └─ High permission (fs/exec/network) → ⚠️ Requires human confirmation
  │
  └─ ✅ Install (pinned version + lockfile)

Update Policy

Pin versions at install time (never auto-pull latest)
Review changelog + diff before updates
Major version updates treated as fresh installs — run full evaluation

Contributing

This is a living document. If you run an OpenClaw instance (or any AI agent with plugin capabilities), we welcome:

Feedback on the framework
Real-world case studies of skill supply chain issues
Implementation suggestions for Layers 3 & 4
Additions to the trusted developer list (with justification)

License

CC BY-SA 4.0 — Share and adapt with attribution.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SKILL-POLICY.md		SKILL-POLICY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenClaw Skill Security Policy

Why This Exists

The Four-Layer Framework

Layer 1 — Source Trust (Necessary but Not Sufficient)

Layer 2 — Static Analysis (Automated)

Layer 3 — Permission Declaration

Layer 4 — Runtime Enforcement

Decision Flowchart

Update Policy

Contributing

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

OpenClaw Skill Security Policy

Why This Exists

The Four-Layer Framework

Layer 1 — Source Trust (Necessary but Not Sufficient)

Layer 2 — Static Analysis (Automated)

Layer 3 — Permission Declaration

Layer 4 — Runtime Enforcement

Decision Flowchart

Update Policy

Contributing

Related

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages