MHN AI Agent Memory

Associative Memory for AI Agents Using Modern Hopfield Networks

A Python library for deterministic associative memory in AI agents.

Install • Quick Start • Features • How It Works • API Reference

          ┌─────────────────────────────────────────────┐
          │                                             │
          │   xi_new = X @ softmax( beta * X^T @ xi )  │
          │                                             │
          │   One equation. One matrix multiply.        │
          │   Deterministic. Microseconds. Free.        │
          │                                             │
          └─────────────────────────────────────────────┘

What is this?

Many agent memory systems store text in a database, then ask an LLM to interpret it at query time. This library does something else: it applies the Modern Hopfield Network update rule directly, exposing transformer-style attention as an explicit, controllable memory.

Store text in a database
Call an LLM to reason about it (costs money, takes seconds, non-deterministic)
Hope the LLM doesn't hallucinate about what it stored

	Traditional (LLM + DB)	This Library
Retrieval	LLM API call	One matrix multiply
Latency	Seconds	Microseconds
Cost	Per-token	Zero after storage
Determinism	Non-deterministic	Deterministic
Capacity	Depends on embedding quality	Exponential in dimension (proven)

Store a fact as a pattern in the energy landscape. Query with a partial cue, and the network relaxes toward the nearest stored pattern. The retrieval weight gives you a confidence signal.

Install

pip install mhn-ai-agent-memory

Optional extras:

pip install mhn-ai-agent-memory[semantic]   # sentence-transformers (~80MB model)
pip install mhn-ai-agent-memory[openai]     # OpenAI embedding API
pip install mhn-ai-agent-memory[scale]      # FAISS for million-scale storage
pip install mhn-ai-agent-memory[all]        # everything

30-Second Example

from hopfield_memory import HopfieldMemory

mem = HopfieldMemory()

mem.store("Alice is a mathematician who studies topology")
mem.store("Bob is a painter who works with oil on canvas")
mem.store("Carol is a physicist researching quantum entanglement")

fact, confidence = mem.query_with_confidence("topology math")
print(fact)        # "Alice is a mathematician who studies topology"
print(confidence)  # 0.9999

No API keys, database, or config files. The memory is a NumPy array.

Features at a Glance

Pluggable Encoders

Choose an encoder based on quality, cost, and dependencies.

from hopfield_memory import HopfieldMemory, SentenceTransformerEncoder

mem = HopfieldMemory(encoder=SentenceTransformerEncoder())

Encoder	Quality	Dependencies
`RandomIndexEncoder`	Basic (exact word match)	numpy only
`TFIDFEncoder`	Medium	scikit-learn
`SentenceTransformerEncoder`	High	sentence-transformers
`OpenAIEncoder`	Highest	openai SDK + API key

Contradiction Detection

Flag likely conflicts before storing new facts.

from hopfield_memory import check_and_store, ContradictionDetector

detector = ContradictionDetector()
idx, conflict = check_and_store(mem, "The capital of France is Lyon", detector=detector)
if conflict.has_conflict:
    print(conflict.explanation)

Multi-Hop Retrieval

Follow related facts across multiple retrieval steps.

from hopfield_memory import chain_query

mem.store("Alice lives in France")
mem.store("The capital of France is Paris")

chain_query(mem, "capital of Alice's country", max_hops=3)
# -> ["Alice lives in France", "The capital of France is Paris"]

Scale from 10 to 10 Million Facts

Use a small in-memory setup for local tools, or add tiered storage when the fact set grows.

from hopfield_memory import small_memory, large_memory, massive_memory

mem = small_memory()    # ~100 facts, for tools and plugins
mem = large_memory()    # ~100k facts, tiered hot/cold storage
mem = massive_memory()  # millions, FAISS-backed cold store

Hot memories stay in the exact Hopfield network. Colder memories can move to an approximate nearest-neighbor index on disk.

Repulsive Attention (Opt-In)

In the included benchmark, repulsive attention improved convergence by up to 17x for multi-step retrieval by adding contrastive "hills" to the energy landscape.

mem = HopfieldMemory(repulsive=True)
mem.store("Alice is a mathematician")
mem.store_negative("Known confusable pattern to avoid")

diag = mem.diagnose("topology math")
print(diag["recommendation"])  # agents decide at runtime

"Nothing Matches" Detection

Sometimes the right answer is "nothing relevant is stored." This library exposes that case instead of forcing the closest pattern.

from hopfield_memory import HopfieldMemory

mem = HopfieldMemory()
mem.store("The Eiffel Tower is in Paris")
mem.store("Mount Fuji is in Japan")

# Returns the fact when there's a match
result = mem.query_or_none("Eiffel Tower Paris")
print(result)  # "The Eiffel Tower is in Paris"

# Returns None when nothing matches
result = mem.query_or_none("basketball playoffs score")
print(result)  # None

# For more detail, inspect the match signals
mq = mem.match_quality("basketball playoffs score")
print(mq["max_similarity"])   # ~0.14 (low -- no real word overlap)
print(mq["is_match"])         # False

Under the hood, the network stores a zero-vector sentinel pattern and combines three signals:

max_similarity -- raw dot product before softmax (the primary signal)
gap -- attention weight separation between top patterns
sentinel_weight -- how much attention goes to the "nothing" anchor

How It Works (Plain English)

You store a fact. The text is encoded into a vector and added to a matrix of stored patterns. This matrix is the memory.
You query with a cue. The network scores the cue against every stored pattern, then uses a softmax to concentrate attention on the strongest matches.
You get a result. The output is the stored pattern the network is attracted to. The top softmax weight is your confidence score.

Key insight: This has the same mathematical structure as a single step of transformer attention (with tied keys and values). The library exposes that operation directly as a memory system, without wrapping it in an LLM.

How It Compares

	This Library	Honcho	Zep	MemGPT/Letta	Vector DB + LLM
Architecture	Hopfield energy landscape	LLM reasoning + DB	Embedding + temporal graph	LLM self-editing context	ANN index + LLM
Retrieval latency	~10 us (numpy matmul)	Seconds (LLM call)	~ms (vector search)	Seconds (LLM call)	~ms (ANN) + seconds (LLM)
Cost per query	Zero	LLM token cost	Zero (self-hosted)	LLM token cost	LLM token cost
Deterministic	Yes	No	Partially	No	No
"No match" detection	Built-in (sentinel)	Via LLM judgment	No	Via LLM judgment	No
Capacity theory	Exponential in dim (proven)	Unbounded (DB)	Unbounded (DB)	Context window	Unbounded (DB)
Dependencies	numpy	Python + LLM API + DB	Python + DB	Python + LLM API	Python + vector DB + LLM API
MCP server	Included	Cursor/Claude plugins	No	No	Custom
Best for	Fast, deterministic agent memory	Personalized long-term user models	Session history	Autonomous context management	Semantic search over documents

These tools solve different problems. Use this library when you need fast, deterministic, cost-free associative recall. Use the others when you need LLM-mediated reasoning about memory, user modeling, or effectively unbounded storage.

Limitations

Default encoder is bag-of-words. "dog" and "canine" get zero similarity without [semantic] extra.
Contradiction detection is heuristic. Works best with simple factual statements.
Multi-hop is retrieval chaining, not logical inference. It finds related facts, not derived conclusions.
Confidence is relative, not absolute. Softmax always sums to 1, so query_with_confidence() always reports high confidence. Use query_or_none() or has_match() to detect non-matches.
Adaptive beta is a heuristic. The convergence proof assumes fixed inverse temperature.
Exponential capacity has conditions. Requires patterns with sufficient separation in high dimension.

Project Structure

mhn-ai-agent-memory/
  pyproject.toml              # Package config
  llms.txt                    # AI agent discoverability
  CITATION.cff                # Academic citation metadata
  src/hopfield_memory/
    network.py                # ModernHopfieldNetwork (the math)
    memory.py                 # HopfieldMemory (the user API)
    repulsive.py              # RepulsiveMHN (contrastive attention)
    encoders.py               # 4 text encoders
    contradiction.py          # Conflict detection
    multihop.py               # Chained retrieval
    tiered.py                 # Hot/cold storage for scale
    presets.py                # small/medium/large/massive factories
  mcp-server/                 # MCP server for Cursor, Claude Code, etc.
  .cursor/                    # Cursor MCP example + project skill
  tests/                      # 43 tests
  examples/                   # Runnable demos
  benchmarks/                 # A/B: baseline vs repulsive
  docs/                       # GitHub Pages blog post

Cursor: shared project working memory

Multiple agents, or multiple chats, can share one on-disk Hopfield store.

To set that up:

Set HOPFIELD_STATE_PATH and HOPFIELD_AUTO_SAVE in the MCP server config.
Copy .cursor/mcp.json.example to .cursor/mcp.json. If ${workspaceFolder} is not expanded in your Cursor build, replace it with absolute paths.
If you keep HOPFIELD_ENCODER=sentence_transformer, install mhn-ai-agent-memory[semantic] into the MCP venv. If you want zero extra dependencies, set HOPFIELD_ENCODER=random instead.

Then use MCP tools such as list_facts, retrieve, and query_or_none as a project-local knowledge base. See mcp-server/README.md for the full setup. The mhn-project-working-memory skill under .cursor/skills/ teaches that workflow.

Development

git clone https://github.com/shahzebqazi/mhn-ai-agent-memory.git
cd mhn-ai-agent-memory
pip install -e ".[dev]"
pytest

References

Ramsauer et al., Hopfield Networks is All You Need, ICLR 2021
Krotov & Hopfield, Dense Associative Memory for Pattern Recognition, NeurIPS 2016
Comparison baseline: Honcho (LLM-augmented database approach)

MIT License • Built by @shahzebqazi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MHN AI Agent Memory

Associative Memory for AI Agents Using Modern Hopfield Networks

What is this?

Install

30-Second Example

Features at a Glance

Pluggable Encoders

Contradiction Detection

Multi-Hop Retrieval

Scale from 10 to 10 Million Facts

Repulsive Attention (Opt-In)

"Nothing Matches" Detection

How It Works (Plain English)

How It Compares

Limitations

Project Structure

Cursor: shared project working memory

Development

References

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.cursor		.cursor
.github		.github
benchmarks		benchmarks
docs		docs
examples		examples
mcp-server		mcp-server
src/hopfield_memory		src/hopfield_memory
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
llms-full.txt		llms-full.txt
llms.txt		llms.txt
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

MHN AI Agent Memory

Associative Memory for AI Agents Using Modern Hopfield Networks

What is this?

Install

30-Second Example

Features at a Glance

Pluggable Encoders

Contradiction Detection

Multi-Hop Retrieval

Scale from 10 to 10 Million Facts

Repulsive Attention (Opt-In)

"Nothing Matches" Detection

How It Works (Plain English)

How It Compares

Limitations

Project Structure

Cursor: shared project working memory

Development

References

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages