This guide walks you through setting up and using Substrate, the infrastructure for human knowledge and progress.
- Git - To clone the repository
- A text editor or IDE
- Bun - JavaScript/TypeScript runtime for running update scripts
curl -fsSL https://bun.sh/install | bash
git clone https://github.com/danielmiessler/Substrate.git
cd Substrate# Core datasets (GDP, inflation, COVID wastewater, etc.)
ls Data/
# Wellbeing data sources (FRED, CDC, Census, BLS, EPA)
ls Data-Sources/
# Knowledge components
ls Problems/
ls Solutions/
ls Arguments/
ls Claims/If you want to run the TypeScript automation scripts:
bun installAll data is stored in human-readable formats:
- CSV files - Raw data you can open in Excel, Google Sheets, or any spreadsheet app
- Markdown files - Documentation and metadata
No special tools required - just browse the repository!
# See the data files
ls Data/US-GDP/
# View the CSV in terminal
cat Data/US-GDP/us-gdp-annual.csv | head -20
# Or open in your favorite spreadsheet app
open Data/US-GDP/us-gdp-annual.csvNote: This is optional. The data is already included in the repository.
cd Data/US-GDP
bun run update.tsMost wellbeing sources require API keys:
# Set your API key
export FRED_API_KEY="your_key_here"
# Run the update
cd Data-Sources/DS-00004—FRED_Economic_Wellbeing
bun run update.tsbun run scripts/update-all.tsMost data sources are free but require registration:
| Data Source | Get Key | Rate Limit |
|---|---|---|
| FRED Economic | fred.stlouisfed.org/docs/api | 120 req/min |
| Census ACS | api.census.gov/data/key_signup | 500 req/day |
| EPA Air Quality | Email: aqs.support@epa.gov | 10 req/min |
| BLS JOLTS | bls.gov/developers/home | 500 req/day |
| CDC WONDER | No key required | Fair use |
Create a .env file in the repository root:
FRED_API_KEY=your_fred_key
CENSUS_API_KEY=your_census_key
EPA_EMAIL=your_email@example.com
EPA_KEY=your_epa_key
BLS_API_KEY=your_bls_keyOr export them in your shell:
export FRED_API_KEY="your_fred_key"
export CENSUS_API_KEY="your_census_key"We welcome contributions! Here's how to add to Substrate:
| Type | Directory | Description |
|---|---|---|
| Problems | Problems/ |
Documented challenges with evidence |
| Solutions | Solutions/ |
Proven approaches with results |
| Arguments | Arguments/ |
Reasoning chains with quality scores |
| Claims | Claims/ |
Assertions linked to evidence |
| Data | Data/ or Data-Sources/ |
Authoritative datasets |
| People | People/ |
Researchers and practitioners |
| Organizations | Organizations/ |
Groups working on problems |
| Projects | Projects/ |
Active initiatives |
| Plans | Plans/ |
Actionable strategies |
| Ideas | Ideas/ |
Frameworks and concepts |
| Values | Values/ |
Guiding principles |
-
Fork the repository on GitHub
-
Create a branch for your contribution:
git checkout -b add-my-contribution
-
Add your content following the format in each directory's README
-
Commit your changes:
git add . git commit -m "Add: description of your contribution"
-
Push and create a Pull Request:
git push origin add-my-contribution
- Follow existing formats - Check the README in each directory for templates
- Include evidence - Link claims to data sources where possible
- Use clear IDs - Follow the naming conventions (e.g.,
PR-00001for problems) - Document your sources - Include provenance and methodology
- Be specific - Vague contributions are harder to use
We don't gatekeep ideas, but we do maintain quality:
- Problems should be specific and evidence-backed
- Solutions should include outcomes or expected results
- Arguments should have clear reasoning chains
- Data should include methodology and limitations
Substrate/
├── Data/ # Core datasets
│ ├── US-GDP/ # US GDP data (1929-present)
│ ├── US-Inflation/ # US inflation data
│ ├── Bay-Area-COVID/ # COVID wastewater monitoring
│ ├── Pulitzer-Prize/ # Pulitzer Prize winners
│ └── Knowledge-Worker-Salaries/
│
├── Data-Sources/ # Wellbeing data sources
│ ├── DS-00001—WHO_Global_Health_Observatory/
│ ├── DS-00002—UN_SDG_Indicators/
│ ├── DS-00003—World_Bank_Open_Data/
│ ├── DS-00004—FRED_Economic_Wellbeing/
│ ├── DS-00005—CDC_WONDER_Mortality/
│ ├── DS-00006—Census_ACS_Social_Wellbeing/
│ ├── DS-00007—BLS_JOLTS_Labor_Market/
│ └── DS-00008—EPA_Air_Quality_System/
│
├── Problems/ # Documented challenges
├── Solutions/ # Proven approaches
├── Arguments/ # Reasoning chains
├── Claims/ # Evidence-linked assertions
├── Plans/ # Actionable strategies
├── Ideas/ # Frameworks and concepts
├── People/ # Researchers and practitioners
├── Organizations/ # Groups working on issues
├── Projects/ # Active initiatives
├── Values/ # Guiding principles
│
├── scripts/ # Automation scripts
├── README.md # Main documentation
├── GETTING_STARTED.md # This file
├── QUICK_REFERENCE.md # Command cheatsheet
└── UPDATES.md # Changelog
- Explore the data - Browse
Data/andData-Sources/to see what's available - Read the documentation - Each dataset has its own README with methodology
- Try the automation - Run an update script to see how data is refreshed
- Contribute - Add a problem, solution, or data source you care about
- Issues - github.com/danielmiessler/Substrate/issues
- Discussions - Start a discussion in the repository
- Twitter - @danielmiessler