Pinguide

A RAG (Retrieval-Augmented Generation) system that answers questions about pinball machines using real rulesheets and strategy guides as its knowledge base. Ask anything about rules, modes, scoring, or strategy — it finds the relevant source material and generates a grounded answer.

Live: pinguide.info

Built as a personal project. Not commercial.

What it covers

Source	What it is	Machines
Tiltforums wiki rulesheets	Community rule wikis for modern machines	~95
PAPA rulesheet archive	Classic machine rulesheets	~141
Bob's Guide	Strategy guides for classic and EM-era machines	~412
Open Pinball Database	Machine metadata: manufacturer, year, model variants	~2,366

About 15,600 indexed chunks in total.

How it works

Three steps, run once to build the knowledge base:

Scrape — four scrapers fetch rulesheets and machine metadata and save structured JSON to data/raw/
Ingest — chunks each source by section, generates embeddings via OpenAI, and stores everything in a ChromaDB vector store (~380 MB on disk)
Query — at runtime, extracts the machine name from the question, retrieves the top candidates (per game variant, to guarantee equal representation), reranks them, and passes the best 5–8 to the LLM as context

We chunk by natural section boundaries (Modes, Multiball, Strategy, etc.) rather than fixed token windows, so each chunk is about one coherent topic.

Retrieval strategy

Three techniques work together:

Game name filtering — when a question names a specific machine, retrieval is restricted to chunks from that machine only. The system handles manufacturer-prefixed titles ("Stern Godzilla" vs. "Godzilla") and expands to variants (e.g. "Trident" also retrieves "Trident 2022"). General questions with no machine name get global retrieval.

HyDE (Hypothetical Document Embeddings) — pinball questions use everyday language; rulesheets use domain-specific terminology. Before searching, the LLM writes a short hypothetical rulesheet excerpt that would answer the question, and we embed that instead of the raw question. This closes the vocabulary gap and significantly improves retrieval precision.

Reranking — after retrieval, Voyage AI's rerank-2 cross-encoder scores each (question, chunk) pair for true relevance and reorders them. The top 5 go to the LLM for single-machine questions; when multiple variants of the same machine are in scope (e.g. Metallica and Metallica Remastered), each is queried separately and the top 8 go to the LLM, ensuring both versions are covered in the answer.

The system prompt

The LLM's behavior is shaped by a system prompt, which continues to evolve as we learn what produces better answers:

You are a pinball machine expert assistant. This system covers roughly N machines
across Tiltforums, PAPA, and Bob's Guide rulesheet archives. All questions refer to
pinball machines and their rules, modes, and strategies — not to bands, films, or
other topics those names may refer to. Answer using ONLY the rulesheet excerpts
provided below.
If asked how many machines are covered or what machines are available, answer using
the count above — do not say you don't know.
If the question is just a game name with no specific question, provide a concise
overview of that machine's main modes and strategy using the excerpts, and invite a
follow-up question.
When excerpts from multiple distinct machines are present (e.g. an original and a
remastered version), you MUST structure your answer with a clearly labeled heading
for EACH machine and list each machine's information under its own heading. Never
blend information from different machines into a single undifferentiated list.
If the excerpts don't contain enough information to answer confidently, say so —
do not speculate.

The machine count (N) is injected at startup from the live vector store, so it stays accurate as new machines are added.

Tech stack

Layer	Tool
RAG framework	LlamaIndex
Vector store	ChromaDB (on disk)
Embeddings	OpenAI `text-embedding-3-small`
Reranking	Voyage AI `rerank-2`
LLM	Anthropic Claude Haiku
API	Flask + flask-cors
Frontend	Plain HTML/CSS/JS
Deployment	Railway + Cloudflare

Costs

One-time: embedding the full corpus costs about $0.25 in OpenAI credits.

Per query:

Step	Cost
HyDE generation	~$0.0001
Voyage reranking	~$0.0002
Answer generation	~$0.001
Total	~$0.001–0.002

At 10 queries a day, monthly cost is well under $1.

Project structure

pinguide-rag/
├── scraper/          # tiltforums.py, papa.py, bobs_guide.py, opdb.py
├── ingest/           # pipeline.py — chunk, embed, store
├── api/              # app.py — Flask /query endpoint
├── frontend/         # index.html — single-page UI
├── data/raw/         # scraped JSON (committed to git)
└── vector_store/     # ChromaDB data (Railway Volume — not in git)

Local setup

Prerequisites: Python 3.11, API keys for Anthropic, OpenAI, OPDB, and Voyage AI.

git clone <repo-url>
cd pinguide-rag
python3.11 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
cp .env.example .env   # then fill in your keys

Scraped data is already committed, so you can go straight to ingest:

python -m ingest.pipeline   # ~$0.25 in OpenAI embedding credits
flask --app api/app run --port 5001

To re-scrape fresh data first:

python -m scraper.tiltforums
python -m scraper.papa
python -m scraper.bobs_guide
python -m scraper.opdb

Deployment

Runs on Railway as a Python web service. ChromaDB data lives on a 5 GB Railway Volume (not in git) and is updated by SSHing into the service and re-running python -m ingest.pipeline after data changes.

Build: pip install -r requirements.txt
Start: gunicorn api.app:app (single worker and 120s timeout are set in gunicorn.conf.py)
Environment variables: set all API keys in the Railway dashboard
Domain: pinguide.info via Cloudflare DNS

Planned

Playfield diagram support for layout questions ("where is the left ramp?"), which requires image embeddings and a multimodal retrieval path.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pinguide

Table of contents

What it covers

How it works

Retrieval strategy

The system prompt

Tech stack

Costs

Project structure

Local setup

Deployment

Planned

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.claude		.claude
api		api
data/raw		data/raw
frontend		frontend
ingest		ingest
scraper		scraper
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
gunicorn.conf.py		gunicorn.conf.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Pinguide

Table of contents

What it covers

How it works

Retrieval strategy

The system prompt

Tech stack

Costs

Project structure

Local setup

Deployment

Planned

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages