Human-Likeness in Chess

A small research project quantifying how "human-typical" famous chess grandmasters play, using Maia-2, a deep-learning model trained exclusively on human games to predict what a player at a given rating would do in any position.

→ Interactive visualisation + write-up

What it does

I ran Maia-2 on 18,000 moves per grandmaster (opening, middlegame, endgame) and recorded the probability the model assigned to each player's actual move. That probability is the "human-likeness" score: high means the move was typical of how humans (at that skill level) play, low means it was more unusual.

Plotting each player as a point in (opening, middlegame, endgame) human-likeness space reveals a strong latent structure: a single axis (PC1) explains ~58% of the variance, and it runs consistently across all three phases.

Note: The trend is better visualized through the interactive 3D plot. Click the image to open the website.

And here are those values collapsed onto PC1 (least to most human-like):

Key findings

One latent axis dominates. A single "human-likeness" dimension accounts for the majority of variance across all three game phases, which is non-trivial given how different opening theory, middlegame tactics, and endgame technique are.
Orthogonal to skill and era. Peak ELO and era of play both show essentially zero correlation with human-likeness (R² ≈ 0.009 and 0.016). High human-likeness is not just "weaker player" or "older era."
Acts as a fingerprint. Compressing each player down to three humanness values preserves enough identity signal that a k-NN classifier re-identifies players from held-out games at ~55% top-5 accuracy (vs. 9% random baseline).

Reproducing the results

This repo uses Git LFS for some of the larger files. Please ensure you have Git LFS installed before cloning.

All scripts are run from the project root:

# 1. Download PGN files (~65 grandmasters from pgnmentor.com)
python -m pipeline.fetch

# 2. Parse PGNs and segment moves by game phase (cached after first run)
python -m pipeline.parse

# 3. Run Maia-2 inference on training moves (slow)
python -m pipeline.inference

# 4. Visualise
python -m analysis.plot_interactive     # → outputs/plot_means_interactive.html
python -m analysis.plot_means           # → outputs/regression_means_plot.png
python -m analysis.plot_pc1             # → outputs/pc1_projection.png

# 5. Stylometric verification (cached after first run)
python -m analysis.stylometry_3d
python -m analysis.stylometry_6d

Cached files are already present in the repo.

Dependencies

pip install -r requirements.txt

License and Usage

All rights reserved. I'm sharing this repository publicly as part of my portfolio to showcase my current progress and methodology. Since this is an ongoing, unpublished project, I kindly ask that you do not copy or reuse the code or findings just yet. If you're interested in the project or want to discuss the approach, feel free to reach out!

Mattia Greiche — mattia.greiche@mail.mcgill.ca

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
analysis		analysis
data		data
maia2_models		maia2_models
outputs		outputs
pipeline		pipeline
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human-Likeness in Chess

What it does

Key findings

Reproducing the results

Dependencies

License and Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Human-Likeness in Chess

What it does

Key findings

Reproducing the results

Dependencies

License and Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages