synth-lib

Tools for downloading minute-level market data and backtesting miners against Synth Subnet scoring.

The library is organised into two pieces:

app/lib/preparation/market_data.py — downloads and stores minute closes as daily parquet partitions.
app/lib/backtester/backtest.py — scores a miner's predictions against real price paths and computes smoothed scores / reward weights using the same logic the live validator runs.

Requirements

Python ≥ 3.12
uv for dependency management

Install dependencies once:

uv sync

1. Download market data

Data is fetched from Pyth (spot equities, commodities, majors) or Hyperliquid (perps) depending on the asset, and cached under market_data/pyth/{ASSET}/1m/date=YYYY-MM-DD.parquet. Existing finalised partitions are skipped unless --force-refresh is passed.

All supported assets, default 15-month window

uv run app/lib/preparation/market_data.py

This downloads 15 months of history ending yesterday. The asset list comes from synth.validator.price_data_provider.PriceDataProvider and includes majors such as BTC, ETH, SOL, plus tokenised equities/commodities (XAU, SPYX, NVDAX, TSLAX, AAPLX, GOOGLX).

A single asset

uv run app/lib/preparation/market_data.py --asset BTC

Recent N days (includes today)

# Last 3 days for BTC only, including today
uv run app/lib/preparation/market_data.py --asset BTC --days 3

# Last 7 days across every asset
uv run app/lib/preparation/market_data.py --days 7

--days N anchors the window at today (inclusive). Today's partition is marked is_final=False and is re-downloaded on every subsequent run.

Re-download existing partitions

uv run app/lib/preparation/market_data.py --asset BTC --force-refresh

CLI flags

Flag	Default	Description
`--asset`	all assets	Single asset symbol; omit to download every supported asset
`--days`	(15-month default window)	Download N days ending today (inclusive); overrides the default window
`--force-refresh`	off	Re-download partitions that already exist on disk

From Python

from app.lib.preparation.market_data import download_market_data, download_all_assets

# Single asset, default window
download_market_data("BTC")

# Recent N days, every asset
download_all_assets(days=30)

Output layout

market_data/
└── pyth/
    └── BTC/
        └── 1m/
            ├── date=2025-01-15.parquet
            ├── date=2025-01-16.parquet
            └── ...

Each parquet contains timestamp, close, source, ingested_at, and is_final columns. Rows are minute-aligned UTC; gaps are stored as NaN.

2. Run a backtest

A backtest compares a miner's prediction files to real prices, computes CRPS per prompt, and then replays the validator's smoothed-score / reward-weight calculation to produce the miner's rank over time.

Prediction files

The backtester reads prediction files from miner_outputs/{miner_name}/predictions/**/*.json. Filenames must follow:

YYYY-MM-DD_HH:MM:SSZ_{ASSET}_{time_length}.json

For example: 2026-03-28_00:00:00Z_BTC_86400.json. Two JSON layouts are accepted (see load_prediction):

Flat notebook format: {"start_timestamp", "asset", "time_increment", "time_length", "paths", ...}
ArtifactManager format: {"simulation_input": {...}, "prediction": [meta, meta, path, ...]}

paths is a num_simulations × (num_steps + 1) array of simulated price paths starting from the current price.

Running the backtester

Targets are expressed as a pair: which frequency profile to run (--profile) and which asset(s) to run within it (--asset). Defaults are --profile all --asset ALL, i.e. every asset in every profile. If no prediction files exist under miner_outputs/{miner_name}/predictions/, the script auto-generates random-walk predictions into a temp directory so the pipeline can be verified end-to-end.

# Default: both profiles × every asset, last 2 days
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent

# BTC across both profiles
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent --asset BTC

# Multiple assets in LOW_FREQUENCY (space-, comma-, or quoted list all work)
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent --profile low --asset BTC ETH TSLAX
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent --profile low --asset BTC,ETH,TSLAX
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent --profile low --asset "BTC ETH TSLAX"

# Every asset in HIGH_FREQUENCY only
uv run app/lib/backtester/scripts/run_backtest.py --miner-name gbm_agent --profile high --asset ALL

# Longer window or custom predictions directory
uv run app/lib/backtester/scripts/run_backtest.py \
    --miner-name gbm_agent \
    --days 7 \
    --profile low --asset BTC \
    --predictions-dir /path/to/predictions

Assets not in a given profile's asset_list are filtered out for that profile; if nothing matches anywhere the script exits with a clear error listing the supported assets per profile.

CLI flags (see run_backtest.py):

Flag	Default	Description
`--miner-name`	`btc_research`	Subdirectory under `miner_outputs/` for predictions and chart output
`--days`	`2`	Length of the backtest window (ending now)
`--profile`	`all`	Profile to backtest: `low`, `high`, or `all`
`--asset`	`ALL`	One or more asset symbols (space-, comma-, or space-inside-quotes separated), or `ALL` for every asset in the selected profile(s)
`--predictions-dir`	`miner_outputs/{miner_name}/predictions`	Where to read predictions from

Parallelism

The runner uses three nested pools, so a full sweep finishes much faster than the asset count would suggest:

Profiles (low, high) run in a ThreadPoolExecutor (one thread per profile).
Inside each profile, assets run in their own ThreadPoolExecutor (up to 6 at a time, I/O-bound on the Synth API).
Inside each asset, per-prompt CRPS scoring is dispatched to a shared ProcessPoolExecutor sized to cpu_count() - 2.

From Python

from synth.validator.prompt_config import LOW_FREQUENCY
from app.lib.backtester.backtest import run_backtest, backtest

# Single asset
single = backtest(
    miner_name="gbm_agent",
    asset="BTC",
    time_length=86_400,     # 24h prompts (LOW_FREQUENCY) — use 3_600 for HIGH_FREQUENCY
    time_increment=300,     # 5-minute steps
    n_backtest_days=7,
)

# Whole profile (parallel across assets + emits per-profile TOTAL charts when ≥2 succeed)
results, combined = run_backtest(
    miner_name="gbm_agent",
    prompt_config=LOW_FREQUENCY,
    n_backtest_days=7,
)

print(single.summary)       # {num_prompts, mean_crps, final_smoothed_score, ...}
single.prompt_df            # per-prompt CRPS and scores (incl. every other miner)
single.smoothed_scores      # per-round smoothed score + reward_weight

Output

Charts are written to miner_outputs/{miner_name}/charts/:

Per-asset:

rank_evolution_{asset}_{time_length}.png — rank over time (1 = best)
crps_over_time_…png, crps_by_hour_…png, crps_by_day_…png
crps_ratio_dist_…png — distribution of your CRPS relative to median
weekly_percentile_…png — percentile rank per calendar week

Per profile (emitted when ≥2 assets in that profile produce results):

rank_evolution_TOTAL_{profile}.png — combined rank across the profile's assets
estimated_earnings_{profile}.png — per-round USD + cumulative earnings estimate

Grand total (emitted when both profiles produced data):

rank_evolution_GRAND_TOTAL.png
estimated_earnings_GRAND_TOTAL.png

The console prints per-asset rank, reward weight, smoothed score, prompt count, mean CRPS, and the paths of every saved chart.

Known caveats

HIGH_FREQUENCY CRPS formula change on 2026-03-11

On 2026-03-11 the Synth validator changed how it computes CRPS for the HIGH_FREQUENCY profile. CRPS values that the API still returns for prompts scored before that date were produced by the previous formula and are not directly comparable to the values the validator computes today. Practically, that means an HF backtest whose window includes any pre-cutoff prompts will mix two different scoring regimes, so the resulting ranks and reward weights won't match what the same predictions would receive on the live network now.

The smoothing window also looks back prompt_config.window_days (3 days for HIGH_FREQUENCY), so the smoothed score remains contaminated by pre-cutoff CRPS until 2026-03-14.

If you see the corresponding UserWarning from run_backtest, you have two ways to get current-formula ranks:

# 1. Restrict evaluation to dates on or after the formula change.
uv run app/lib/backtester/scripts/run_backtest.py \
    --miner-name gbm_agent --profile high \
    --eval-end 2026-04-15 --days 30

# 2. Simulate registering the miner on 2026-03-14 (= cutoff + window_days),
#    so the smoothing window is fully post-change.
uv run app/lib/backtester/scripts/run_backtest.py \
    --miner-name gbm_agent --profile high \
    --simulate-registration 2026-03-14

The LOW_FREQUENCY profile is unaffected.

Tests

Unit and integration tests live in tests/lib/backtester/.

uv run pytest tests/lib/backtester/

Notes

The backtester pulls scored prompts and rewards history from https://api.synthdata.co. The Synth API rate-limits; requests are retried with exponential backoff. Long multi-asset runs take a while.
download_price_data reads exclusively from local parquet partitions. Make sure the relevant market_data/pyth/{ASSET}/1m/ directory is populated (see section 1) — the HIGH_FREQUENCY profile needs coverage up to today since its prompt window is only 1h.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
app		app
tests/lib		tests/lib
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

synth-lib

Requirements

1. Download market data

All supported assets, default 15-month window

A single asset

Recent N days (includes today)

Re-download existing partitions

CLI flags

From Python

Output layout

2. Run a backtest

Prediction files

Running the backtester

Parallelism

From Python

Output

Known caveats

HIGH_FREQUENCY CRPS formula change on 2026-03-11

Tests

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

synth-lib

Requirements

1. Download market data

All supported assets, default 15-month window

A single asset

Recent N days (includes today)

Re-download existing partitions

CLI flags

From Python

Output layout

2. Run a backtest

Prediction files

Running the backtester

Parallelism

From Python

Output

Known caveats

HIGH_FREQUENCY CRPS formula change on 2026-03-11

Tests

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages