For a faster clone:
git clone --depth=1 git@github.com:Foundations-of-Computer-Vision/visionbook.git-
Install Quarto — download the CLI and verify with
quarto --version -
Preview the book:
quarto preview
This opens a live-reloading browser tab at
localhost:<port>.
Chapters are written in .qmd files (Quarto Markdown), which use syntax similar to Markdown with LaTeX math support. Quarto converts these to HTML for the website.
Converts 2D textbook figures into interactive 3D HTML visualizations using GPT + Three.js, with automatic planning, generation, and critique scoring.
# Backend
cd figure-platform/backend
npm install
cp .env.example .env # then paste your OpenAI API key
# Frontend
cd ../frontend
npm installEdit backend/.env:
OPENAI_API_KEY=sk-...your-key-here
Open two terminals:
# Terminal 1 — backend (port 3001)
cd figure-platform/backend && node server.js
# Terminal 2 — frontend (port 3000)
cd figure-platform/frontend && npm startThen open http://localhost:3000.
- Pick a chapter → see which figures are 3D candidates
- Generate All → plans each figure, then generates interactive HTML (runs in parallel)
- Auto-evaluate → critic scores each result on 5 rubrics (1–5) and flags failure modes
- Results tab → browse by experiment, model, chapter; compare runs side-by-side