Void Memory

Your AI agent forgets everything on auto-compact. This fixes that.

Every Claude Code session, every long conversation, every context window reset — your agent starts from zero. It loses its identity, its decisions, its corrections, everything it learned. You brief it again. It forgets again.

Void Memory gives AI agents persistent memory that survives auto-compacts, restarts, and session boundaries. One void_recall("who am I, what am I working on") and the agent is back — identity, context, accumulated knowledge — in under 10ms.

But it's not just persistence. Most memory systems dump everything into context and hope for the best. Void Memory actively carves out 30% structural absence — filtering noise before it reaches the agent, so recall is clean, relevant, and fits within the context budget. Inspired by Ternary Photonic Neural Network research where a 30% void fraction emerged as a topological invariant.

We built this because we needed it. Six AI agents run on our team 24/7. They auto-compact constantly. Without Void Memory, they'd be goldfish. With it, they remember who they are, what they've built, and what went wrong last time.

Benchmarks

Tested on 992-block corpus, 8 diverse queries:

xychart-beta
    title "Relevance Comparison (%)"
    x-axis ["Void Memory", "Simple RAG", "Naive Stuffing"]
    y-axis "Relevance %" 0 --> 100
    bar [84.2, 10.5, 23.7]

Method	Relevance	Latency	Tokens Used	Noise Hits	Efficiency
Void Memory	84.2%	62ms	292	0	2.88/1K
Simple RAG	10.5%	22ms	226	0	0.47/1K
Naive Stuffing	23.7%	0ms	44,621	60%	0.01/1K

8x more relevant than RAG. 288x more token-efficient than context stuffing. Zero noise.

Void Memory achieved 100% relevance on 4 of 8 queries. RAG returned 0% relevance on 5 of 8. The per-query numbers are even more dramatic — see benchmarks/.

Who Is This For?

Claude Code users tired of re-briefing their agent after every auto-compact
Multi-agent teams that need each agent to maintain its own persistent identity
AI app builders who want structured memory without the complexity of vector databases
Anyone running LLMs in production who needs fast, explainable, budget-aware recall

Free for personal and non-commercial use. Commercial licenses available.

How It Works

Three states, three passes:

graph LR
    Q[Query] --> P1[Pass 1: Score]
    P1 --> P2[Pass 2: Void]
    P2 --> P3[Pass 3: Budget]
    P3 --> R[Clean Result]

    P1 -.- S1[TF-IDF + recency + confidence]
    P2 -.- S2[Cluster → suppress 30% off-topic]
    P3 -.- S3[Fit to token budget, never truncate]

    style P1 fill:#10b981,color:#000
    style P2 fill:#3b82f6,color:#000
    style P3 fill:#f59e0b,color:#000
    style R fill:#6366f1,color:#fff

Pass 1 — Score: TF-IDF keyword matching with confidence multipliers and recency boost.

Pass 2 — Void: Cluster blocks by topic (Jaccard similarity). Detect score gaps. Suppress off-topic clusters until 30% void fraction is reached. Hub dampening prevents over-accessed blocks from dominating.

Pass 3 — Budget: Fit scored, non-voided blocks into a token budget (default 2% of context window). Never truncates — reports what was voided and why.

Three States

graph TD
    subgraph "+1 Active"
        A[Retrieved for this query]
    end
    subgraph "0 Void"
        V[Suppressed — off-topic for this query]
    end
    subgraph "-1 Inhibitory"
        I[Corrections & superseded blocks]
    end

    A -.->|"low relevance"| V
    V -.->|"new correction"| I
    I -.->|"permanently blocked"| X[Never recalled]

    style A fill:#10b981,color:#000
    style V fill:#3b82f6,color:#fff
    style I fill:#ef4444,color:#fff
    style X fill:#333,color:#999

Confidence Lifecycle

Blocks earn their place through use:

graph LR
    S[stored] -->|1st recall| A[accessed]
    A -->|3rd recall| C[confirmed]

    S -.- SD[New, untested]
    A -.- AD[Proven relevant]
    C -.- CD[High-trust memory]

    style S fill:#8b5cf6,color:#fff
    style A fill:#f59e0b,color:#000
    style C fill:#10b981,color:#000

stored: New block, untested
accessed: Recalled at least once, proving some relevance
confirmed: Recalled 3+ times, high-trust memory

Quality Gates

Minimum 20 characters, 30% alphabetic content
80% keyword overlap auto-dedup (updates existing block instead of duplicating)
Supersession: new block can mark old block as inhibitory (-1)

Quick Start

Step 1: Install

npm install void-memory

Step 2: Add to your MCP config

Claude Code (~/.claude/settings.local.json), Cursor (.cursor/mcp.json), Windsurf, Continue.dev, Cline — all use the same config:

{
  "mcpServers": {
    "void-memory": {
      "command": "node",
      "args": ["node_modules/void-memory/dist/mcp-server.js"],
      "env": {
        "VOID_DATA_DIR": "./memory"
      }
    }
  }
}

Full install guide for 11 platforms including OpenClaw, ChatGPT, Ollama, Docker, and multi-agent setups.

Step 3: Add to your CLAUDE.md (the magic)

This is what makes it sticky. Add this to your project's CLAUDE.md and your agent will automatically restore its memory on every session start and after every auto-compact:

## Memory — Void Memory

You have persistent memory via Void Memory MCP tools.

**On every session start and after every auto-compact:**
1. Run `void_recall("who am I, what am I working on")` to restore your identity and context
2. Run `void_stats()` to verify memory health (should show block counts)

**While working:**
- When you learn something important, store it: `void_store({content: "...", keywords: [...], category: "fact"})`
- Before making changes to systems you've worked on before, recall first: `void_recall("topic")`
- When corrected, store the correction immediately before fixing

**Categories:** fact, preference, context, skill, episode, decision

Your memories persist across sessions. You are not starting from zero.

That's it. Your agent now remembers who it is. You now have 5 tools:

Tool	Description
`void_recall`	Query memory with three-pass pipeline
`void_store`	Store knowledge (quality-gated, deduped)
`void_stats`	Memory health dashboard
`void_zones`	See what the void is filtering
`void_explain`	Understand the system

With Any HTTP Client (REST API)

# Start the dashboard server
npx void-memory-dashboard  # runs on port 3410

# Recall
curl -X POST http://localhost:3410/api/recall \
  -H "Content-Type: application/json" \
  -d '{"query": "deployment process", "budget": 2000}'

# Store
curl -X POST http://localhost:3410/api/store \
  -H "Content-Type: application/json" \
  -d '{"content": "Always run tests before deploy", "keywords": ["deploy", "tests"], "category": "skill"}'

# Stats
curl http://localhost:3410/api/stats

Programmatic (TypeScript/JavaScript)

import { openDB } from 'void-memory/db';
import { recall, store, stats } from 'void-memory/engine';

const db = openDB('./my-memory');

// Store knowledge
store(db, {
  content: 'The deploy script lives at /scripts/deploy.sh',
  keywords: ['deploy', 'script', 'location'],
  category: 'fact',
});

// Recall with void filtering
const result = recall(db, 'how do I deploy?', 2000);
console.log(result.blocks);        // relevant memories
console.log(result.void_zones);    // what was suppressed
console.log(result.void_fraction); // ~0.30

Architecture

┌─────────────────────────────────────────────┐
│                 void-memory                  │
├─────────────┬─────────────┬─────────────────┤
│  MCP Server │  REST API   │  Direct Import  │
│   (stdio)   │  (HTTP)     │  (TypeScript)   │
├─────────────┴─────────────┴─────────────────┤
│              Engine (engine.ts)               │
│  TF-IDF → Void Marking → Budget Fit          │
├──────────────────────────────────────────────┤
│              SQLite (db.ts)                   │
│  blocks | recall_log | inhibitions            │
└──────────────────────────────────────────────┘

Zero external dependencies beyond SQLite. No embedding models, no vector databases, no API keys.

engine.ts — 517 lines. Three-pass recall, store with quality gates, stats, void zones.
db.ts — 89 lines. Schema, migrations, SQLite setup.
mcp-server.ts — 240 lines. MCP JSON-RPC over stdio.

Total: ~850 lines of TypeScript.

Why Not Just Use RAG?

Feature	Void Memory	Standard RAG
Noise filtering	Active void suppression	Threshold cutoff only
Context budget	Hard token limit, never overflows	Hope for the best
Corrections	Inhibitory blocks suppress outdated info	Old info persists
Speed	<10ms (no embeddings)	50-500ms (embedding + vector search)
Dependencies	SQLite only	Embedding model + vector DB
Explainability	void_zones shows what was filtered	Black box similarity scores

Configuration

Environment variables:

Variable	Default	Description
`VOID_DATA_DIR`	`./data`	Directory for SQLite database

Engine constants (in engine.ts):

Constant	Value	Description
`DEFAULT_BUDGET`	4000 tokens	Default recall budget (~2% of 200K context)
`MAX_BUDGET`	10000 tokens	Maximum recall budget
`VOID_TARGET`	0.30	Target void fraction (30%)
`MAX_CANDIDATES`	100	Max blocks scored per recall
`CLUSTER_THRESHOLD`	0.25	Jaccard similarity for topic clustering

The Science

The 30% void fraction is not arbitrary. In our Ternary PNN research:

Binary networks (0/1) on 100x100 grids: 17.5% accuracy
Ternary networks (+1/0/-1) with learned void: 76.5% accuracy (p = 2.18e-11)
Void fraction stabilizes at ~28-30% across all random seeds
The void is a topological attractor — the system finds it regardless of initialization

The same principle applies to memory: by actively carving out 30% structural absence, the remaining 70% flows through interference-free channels.

Multi-Agent Support

Each agent gets its own isolated memory via VOID_DATA_DIR:

VOID_DATA_DIR=./agent-alpha node dist/mcp-server.js  # Agent 1
VOID_DATA_DIR=./agent-beta  node dist/mcp-server.js  # Agent 2

Independent blocks, confidence tracking, and recall history per agent. No cross-contamination.

Docker

FROM node:22-slim
WORKDIR /app
COPY package.json package-lock.json ./
RUN npm ci --omit=dev
COPY dist/ ./dist/
ENV VOID_DATA_DIR=/app/data
CMD ["node", "dist/mcp-server.js"]

docker build -t void-memory .
docker run -v ./data:/app/data void-memory

Production Stats

Running in production with 2,884 blocks across 4 AI agents:

Metric	Value
Avg recall latency	23.6ms
Avg void fraction	36%
Total recalls	104+
Database size	~2MB for 2,884 blocks
Engine size	517 lines TypeScript
Runtime dependencies	1 (`better-sqlite3`)

License

Business Source License 1.1 — free for non-commercial use. Commercial licenses available.

Becomes MIT on 2028-03-10.

Credits

Built by Gavin Saunders and the NeoGate AI team (Tron, Arch, Flynn).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
benchmarks		benchmarks
dist		dist
docs		docs
examples		examples
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RESEARCH.md		RESEARCH.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Void Memory

Benchmarks

Who Is This For?

How It Works

Three States

Confidence Lifecycle

Quality Gates

Quick Start

Step 1: Install

Step 2: Add to your MCP config

Step 3: Add to your CLAUDE.md (the magic)

With Any HTTP Client (REST API)

Programmatic (TypeScript/JavaScript)

Architecture

Why Not Just Use RAG?

Configuration

The Science

Multi-Agent Support

Docker

Production Stats

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Void Memory

Benchmarks

Who Is This For?

How It Works

Three States

Confidence Lifecycle

Quality Gates

Quick Start

Step 1: Install

Step 2: Add to your MCP config

Step 3: Add to your CLAUDE.md (the magic)

With Any HTTP Client (REST API)

Programmatic (TypeScript/JavaScript)

Architecture

Why Not Just Use RAG?

Configuration

The Science

Multi-Agent Support

Docker

Production Stats

License

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages