|
| 1 | +# AI Adoption Share by Industry and Country |
| 2 | + |
| 3 | +Industry-level AI adoption shares derived from the **MAP-AI** (Mapping Artificial Intelligence in Firms) dataset, as described in: |
| 4 | + |
| 5 | +> Garbers, Julio and Terry Gregory. "Mapping Artificial Intelligence: Evidence from Firm-Level Web Data in Europe." |
| 6 | +
|
| 7 | +The MAP-AI data underlying this study -- including firm-level indicators of AI adoption, AI ecosystem roles, and AI technology types for European firms between 2016 and 2024 -- are publicly available via GitHub: <https://github.com/MAP-AI-data/data>. |
| 8 | + |
| 9 | +## File |
| 10 | + |
| 11 | +`ai_adoption_share_by_industry_country.parquet` (also available as `.csv`) |
| 12 | + |
| 13 | +A single long-format table with **60,584 rows**, covering two industry classification systems (NACE Rev. 2 and NAICS 2022) at four digit levels, for four European countries plus a pooled aggregate, annually from 2016 to 2024. |
| 14 | + |
| 15 | +## Schema |
| 16 | + |
| 17 | +| Column | Type | Description | |
| 18 | +|---|---|---| |
| 19 | +| `classification` | string | Industry classification system: `"nace"` or `"naics"` | |
| 20 | +| `code` | string | Industry code (e.g., `"C"`, `"62"`, `"620"`, `"6201"`) | |
| 21 | +| `digit_level` | int | Aggregation level: 1, 2, 3, or 4 | |
| 22 | +| `country` | string | `"Germany"`, `"France"`, `"Luxembourg"`, `"Belgium"`, or `"Pooled"` | |
| 23 | +| `year` | int | Year of observation (2016--2024) | |
| 24 | +| `ai_adoption_share` | float | Share of firms classified as using AI (null if `n_firms` < 30) | |
| 25 | +| `n_firms` | int | Number of unique firms in the industry-country-year cell | |
| 26 | + |
| 27 | +## Coverage |
| 28 | + |
| 29 | +**Countries:** Belgium, France, Germany, Luxembourg, and a pooled aggregate combining all four. |
| 30 | + |
| 31 | +**Years:** 2016--2024 (9 years). |
| 32 | + |
| 33 | +**Industry codes:** |
| 34 | + |
| 35 | +| Classification | 1-digit | 2-digit | 3-digit | 4-digit | |
| 36 | +|---|---|---|---|---| |
| 37 | +| NACE Rev. 2 | 21 sections (A--U) | 88 divisions | 272 groups | 615 classes | |
| 38 | +| NAICS 2022 | 9 sectors | 24 subsectors | 96 industry groups | 304 industries | |
| 39 | + |
| 40 | +NACE codes follow the Eurostat NACE Rev. 2 classification. NAICS codes follow the 2022 NAICS structure published by the U.S. Census Bureau. NACE 1-digit codes correspond to section letters (A--U); higher levels are numeric. NAICS codes are numeric at all levels. |
| 41 | + |
| 42 | +## Methodology |
| 43 | + |
| 44 | +### AI adoption measure |
| 45 | + |
| 46 | +The `ai_adoption_share` column reports the share of firms in a given industry-country-year cell whose websites indicate AI usage, as identified by the MAP-AI indicator. MAP-AI uses a Large Language Model (LLM) to classify firm website content and determine whether a firm adopts AI. The indicator captures *realized* AI adoption -- firms that actively use, develop, or deploy AI technologies -- rather than potential AI exposure. See the paper for full details on the classification methodology and validation. |
| 47 | + |
| 48 | +### Construction of this dataset |
| 49 | + |
| 50 | +For each industry-country-year cell, `ai_adoption_share` is computed as the mean of a binary AI adoption indicator across all firms in the cell. The number of unique firms (`n_firms`) is reported alongside each share. |
| 51 | + |
| 52 | +### Minimum firm threshold |
| 53 | + |
| 54 | +Cells with fewer than 30 firms have `ai_adoption_share` set to null to avoid reporting unreliable shares based on small samples. The `n_firms` count is always reported regardless of the threshold. |
| 55 | + |
| 56 | +### Code validation |
| 57 | + |
| 58 | +Industry codes are validated against official classification files (Eurostat NACE Rev. 2 RDF and 2022 NAICS Structure) before aggregation. Higher-level codes (1-digit through 3-digit) are derived from valid 4-digit codes. |
| 59 | + |
| 60 | +## Citation |
| 61 | + |
| 62 | +If you use this data, please cite: |
| 63 | + |
| 64 | +```bibtex |
| 65 | +@article{garbers_mapping_ai, |
| 66 | + title = {Mapping Artificial Intelligence: Evidence from Firm-Level Web Data in Europe}, |
| 67 | + author = {Garbers, Julio and Gregory, Terry}, |
| 68 | + year = {2025} |
| 69 | +} |
| 70 | +``` |
| 71 | + |
| 72 | +## License |
| 73 | + |
| 74 | +This dataset is licensed under the [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) license. You are free to share and adapt this data for any purpose, provided you give appropriate credit by citing the paper above. |
0 commit comments