Foundations of Computer Vision

Cloning (Large Repo)

⚠️ This repo is large!!

For a faster clone:

git clone --depth=1 git@github.com:Foundations-of-Computer-Vision/visionbook.git

Local Development

Install Quarto — download the CLI and verify with quarto --version
Preview the book:
```
quarto preview
```
This opens a live-reloading browser tab at localhost:<port>.

Content

Chapters are written in .qmd files (Quarto Markdown), which use syntax similar to Markdown with LaTeX math support. Quarto converts these to HTML for the website.

Figure Platform

Converts 2D textbook figures into interactive 3D HTML visualizations using GPT + Three.js, with automatic planning, generation, and critique scoring.

Quick Start

1. Install dependencies

# Backend
cd figure-platform/backend
npm install
cp .env.example .env        # then paste your OpenAI API key

# Frontend
cd ../frontend
npm install

2. Add your API key

Edit backend/.env:

OPENAI_API_KEY=sk-...your-key-here

3. Run

Open two terminals:

# Terminal 1 — backend (port 3001)
cd figure-platform/backend && node server.js

# Terminal 2 — frontend (port 3000)
cd figure-platform/frontend && npm start

Then open http://localhost:3000.

How It Works

Pick a chapter → see which figures are 3D candidates
Generate All → plans each figure, then generates interactive HTML (runs in parallel)
Auto-evaluate → critic scores each result on 5 rubrics (1–5) and flags failure modes
Results tab → browse by experiment, model, chapter; compare runs side-by-side

Name		Name	Last commit message	Last commit date
Latest commit History 589 Commits
.github/workflows		.github/workflows
.vscode		.vscode
_extensions		_extensions
active-reader-platform/frontend/src		active-reader-platform/frontend/src
demos		demos
docs		docs
figure-platform		figure-platform
figures		figures
prompt_experiments		prompt_experiments
scripts		scripts
src @ ec5a3fd		src @ ec5a3fd
.gitignore		.gitignore
.gitmodules		.gitmodules
2d_motion_from_3d.qmd		2d_motion_from_3d.qmd
3d_learning.qmd		3d_learning.qmd
3d_scene_understanding_single_view.qmd		3d_scene_understanding_single_view.qmd
3d_scene_understanding_stereo.qmd		3d_scene_understanding_stereo.qmd
README.md		README.md
VLMs.qmd		VLMs.qmd
_quarto.yml		_quarto.yml
all.bib		all.bib
backpropagation.qmd		backpropagation.qmd
bias_and_shift.qmd		bias_and_shift.qmd
blurring_2.qmd		blurring_2.qmd
camera_as_linsys.qmd		camera_as_linsys.qmd
color.qmd		color.qmd
conditional_generative_models.qmd		conditional_generative_models.qmd
convolutional_neural_nets.qmd		convolutional_neural_nets.qmd
copyright.qmd		copyright.qmd
data_augmentation.qmd		data_augmentation.qmd
derivatives.qmd		derivatives.qmd
elsevier-with-titles.csl		elsevier-with-titles.csl
fairness.qmd		fairness.qmd
generative_modeling_and_rep_learning.qmd		generative_modeling_and_rep_learning.qmd
generative_models.qmd		generative_models.qmd
gradient_descent.qmd		gradient_descent.qmd
graphical_models.qmd		graphical_models.qmd
homogeneous_coordinates.qmd		homogeneous_coordinates.qmd
homography.qmd		homography.qmd
how_to_do_research.qmd		how_to_do_research.qmd
how_to_give_talks.qmd		how_to_give_talks.qmd
how_to_write_papers.qmd		how_to_write_papers.qmd
image_processing_fourier.qmd		image_processing_fourier.qmd
imaging.qmd		imaging.qmd
imaging_geometry.qmd		imaging_geometry.qmd
index.qmd		index.qmd
intro_to_learning.qmd		intro_to_learning.qmd
lenses.qmd		lenses.qmd
linear_image_filtering.qmd		linear_image_filtering.qmd
motion_estimation.qmd		motion_estimation.qmd
motion_estimation_intro.qmd		motion_estimation_intro.qmd
multiview.qmd		multiview.qmd
nerf.qmd		nerf.qmd
neural_nets.qmd		neural_nets.qmd
neural_nets_as_distribution_transformers.qmd		neural_nets_as_distribution_transformers.qmd
notations.qmd		notations.qmd
object_recognition_v3.qmd		object_recognition_v3.qmd
objects.qmd		objects.qmd
optical_flow.qmd		optical_flow.qmd
part_challenges_in_learning_based_vision.qmd		part_challenges_in_learning_based_vision.qmd
part_closing_remarks.qmd		part_closing_remarks.qmd
part_foundation_image_processing.qmd		part_foundation_image_processing.qmd
part_foundation_learning.qmd		part_foundation_learning.qmd
part_foundations.qmd		part_foundations.qmd
part_generative_models_and_representation.qmd		part_generative_models_and_representation.qmd
part_image_formation.qmd		part_image_formation.qmd
part_linear_filters.qmd		part_linear_filters.qmd
part_neural_architectures.qmd		part_neural_architectures.qmd
part_on_research.qmd		part_on_research.qmd
part_sampling_and_multiscale.qmd		part_sampling_and_multiscale.qmd
part_scene_understanding.qmd		part_scene_understanding.qmd
part_statistical_image_models.qmd		part_statistical_image_models.qmd
part_understanding_geometry.qmd		part_understanding_geometry.qmd
part_understanding_motion.qmd		part_understanding_motion.qmd
perceptual_organization.qmd		perceptual_organization.qmd
pre-title.html		pre-title.html
problem_of_generalization.qmd		problem_of_generalization.qmd
pyramids_new_notation.qmd		pyramids_new_notation.qmd
recurrent_neural_nets.qmd		recurrent_neural_nets.qmd
references.qmd		references.qmd
render.yaml		render.yaml
representation_learning.qmd		representation_learning.qmd
representing_the_image.qmd		representing_the_image.qmd
sampling_and_aliasing.qmd		sampling_and_aliasing.qmd
scenes.qmd		scenes.qmd
sectionstat.html		sectionstat.html
series.qmd		series.qmd
simplesystem.qmd		simplesystem.qmd
simplesystem_final.qmd		simplesystem_final.qmd
spatial_filter_sets.qmd		spatial_filter_sets.qmd
stat_image_models_revised.qmd		stat_image_models_revised.qmd
taxonomy.qmd		taxonomy.qmd
temp.tex		temp.tex
temporal_filters_v2.qmd		temporal_filters_v2.qmd
textures.qmd		textures.qmd
transfer_learning.qmd		transfer_learning.qmd
transformers.qmd		transformers.qmd
upsamplig_downsampling_2.qmd		upsamplig_downsampling_2.qmd
visionbib.bib		visionbib.bib
visionbook.css		visionbook.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Foundations of Computer Vision

Cloning (Large Repo)

Local Development

Content

Figure Platform

Quick Start

1. Install dependencies

2. Add your API key

3. Run

How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Foundations of Computer Vision

Cloning (Large Repo)

Local Development

Content

Figure Platform

Quick Start

1. Install dependencies

2. Add your API key

3. Run

How It Works

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages