forked from openlandmap/SoilSamples
-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.Rmd
More file actions
267 lines (215 loc) · 15.8 KB
/
index.Rmd
File metadata and controls
267 lines (215 loc) · 15.8 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
---
title: "An Open Compendium of Soil Datasets"
author: "Tomislav Hengl (OpenGeoHub)"
date: "`r format(Sys.time(), '%d %B, %Y')`"
knit: bookdown::render_book
site: bookdown::bookdown_site
documentclass: svmono
classoption: graybox,natbib,nospthms
geometry: "paperwidth=18.90cm, paperheight=24.58cm, top=2.1cm, bottom=2.1cm, inner=2cm, outer=2cm"
monofont: "Source Code Pro"
monofontoptions: "Scale=0.7"
bibliography: refs.bib
biblio-style: spbasic
link-citations: yes
twitter-handle: tom_hengl
cover-image: img/sol_chem.pnts_sites.png
description: "This is a public compendium of global, regional, national and sub-national soil samples and/or soil profile datasets (points with Observations and Measurements of soil properties and characteristics). Datasets listed here, assuming compatible open license, are afterwards imported into a common library."
github-repo: OpenGeoHub/SoilSamples
url: 'http\://github.com/OpenGeoHub/SoilSamples'
colorlinks: yes
graphics: yes
---
# About
## Rationale
This is a public compendium of global, regional, national and
sub-national **soil samples** and/or **soil profile** datasets (points with
Observations and Measurements of soil properties and characteristics).
Datasets listed here, assuming [compatible open data license](https://opendefinition.org/licenses/), are afterwards
imported into the [**Global compilation of soil chemical and physical
properties and soil classes**](https://gitlab.com/openlandmap/) and
eventually used to create a better open soil information across countries.
The specific objectives of this initiative are:
- To enable data digitization, import and binding + harmonization,\
- To accelerate research collaboration and networking,\
- To enable development of more accurate / more usable global and
regional soil property and class maps (typically published via
<https://OpenLandMap.org>),
The minimum requirements to submit a dataset for inclusion to [the
OpenLandMap repository](https://gitlab.com/openlandmap/) are:
- License and terms of use clearly specified AND,\
- Complete and consistent metadata that can ensure correct
standardization and harmonization steps AND,\
- At least 50 unique spatial locations AND,\
- No broken or invalid URLs,
Datasets that do NOT satisfy the above listed minimum requirements might be
removed. If you discover an issue with license, data description or
version number of a dataset, please open a [Github
issue](https://github.com/OpenGeoHub/SoilSamples/issues).
Recommended settings for all datasets are:
- Peer-reviewed versions of the datasets (i.e. a dataset accompanied
with a peer-reviewed publication) should have the priority,\
- Register your dataset (use e.g. <https://zenodo.org>) and assign a DOI
to each version,\
- Provide enough metadata so that it can be imported and bind with
other data without errors,\
- If your dataset is a compilation of previously published datasets, please
indicate in the description,
Information outdated or missing? Please open an issue or best do a
correction and then a [pull
request](https://docs.github.com/en/github/collaborating-with-issues-and-pull-requests/creating-a-pull-request).
```{r soil-pnts, echo=FALSE, fig.cap="Soil profiles and soil samples with chemical and physical properties global compilation. For more info see: https://gitlab.com/openlandmap/compiled-ess-point-data-sets.", out.width="100%"}
knitr::include_graphics("img/sol_chem.pnts_sites.png")
```
## Existing soil data projects and initiatives
Multiple international organizations from [FAO's Global Soil Partnership](http://www.fao.org/global-soil-partnership/en/) to [UNCCD's Land Degredation Neutrality](https://www.unccd.int/actions/ldn-target-setting-programme), [European Commission](https://esdac.jrc.ec.europa.eu/) and similar,
support soil data collation projects and especially curation of the legacy soil data.
Some existing soil Observations and Measurements (O&M) soil data initiatives include:
- [**FAO's Data Hub**](https://www.fao.org/soils-portal/data-hub/en/),\
- [**Fine Root Ecology Database (FRED)**](https://roots.ornl.gov/),\
- [**FLUXNET global network**](https://fluxnet.fluxdata.org/),\
- [**Global database of soil nematodes**](https://www.nature.com/articles/s41597-020-0437-3),\
- [**Global soil macrofauna database**](http://macrofauna.earthworms.info/),\
- [**Global soil respiration database (SRDB)**](https://github.com/bpbond/srdb),\
- [**International Soil Modeling Consortium (ISMC)**](https://soil-modeling.org),\
- [**International Soil Moisture Network**](https://ismn.geo.tuwien.ac.at/en/),\
- [**International Soil Radiocarbon Database (ISRaD)**](https://soilradiocarbon.org),\
- [**International Soil Carbon Network (ISCN)**](http://iscn.fluxdata.org/),\
- [**LandPKS project**](http://portal.landpotential.org/#/landpksmap),\
- [**Long Term Ecological Research (LTER) Network sites**](https://lternet.edu/site/),\
- [**National Ecological Observatory Network (NEON)**](https://www.neonscience.org),\
- [**Open Soil Spectral Library (OSSL)**](https://soilspectroscopy.org),\
- [**Soils Data Harmonization (SoDaH)**](https://lter.github.io/som-website),\
- [**WoSIS Soil Profile Database**](https://www.isric.org/explore/wosis),
A more in-depth inventory of all various national and international soil datasets can be found in:
- Rossiter, D.G.,: [**Compendium of Soil Geographical Databases**](https://www.isric.org/explore/soil-geographic-databases)\
## Target soil variables
Soil variables of interest include:
1. **Chemical soil properties**:
- Soil organic carbon, total carbon, total nitrogen,
- Soil pH, effective Cation Exchange Capacity (eCEC),
- Soil sodicity (presence of a high proportion of sodium ions relative to other cations),
- Macro-nutrients: extractable --- potassium (K), calcium (Ca), sodium
(Na), magnesium (Mg) and similar,
- Micro-nutrients: phosphorus (P), sulfur (S), iron (Fe), zinc (Zn)
and similar,
- Soil pollutants, heavy metals and similar,
- Electrical conductivity,
2. **Physical soil properties**:
- Soil texture and texture fractions: silt, clay and sand, stone content,
- Bulk density, depth to bedrock and similar,
- Hydraulic conductivity, water content --- Field Capacity (FC; the amount of water
held in the soil after it has been fully wetted and free drainage has stopped),
Permanent Wilting Point (PWP; the soil moisture condition at which the plant could
not obtain water and would wilt and die), Plant Available Water Capacity (PAWC;
the amount of water between field capacity and permanent wilting point water holding capacity) and
similar,
- Soil temperature,
3. **Soil biological / biodiversity variables**:
- Soil biomass,
- Soil micro-, meso-, macro- and mega-fauna abundance,
- Soil biodiversity indices,
4. **Soil classification / taxonomy variables**:
- Soil type,
- Soi suitability classes, soil fertility classes,
- Soil texture classes and families,
5. **Soil absorbances / soil spectroscopy variables**:
- Soil absorbance in VIS-NIR and MIR part of spectra,
## Recommended O&M standards
As a general rule of thumb we recommend all contributors to use the following
[general scheme](https://soilspectroscopy.github.io/ossl-manual/) to organize Soil Observations & Measurements with 3–4 main tables
and metadata + legends organized in other tables:
- Soil site information (geographical coordinates, land use / land cover, soil classification etc),
- Soil horizon information (soil observations and measurements specific to soil layers / diagnostic horizons),
- Proximal soil sensing information including soil scans,
For making soil observations and measurements we recommend following the USDA [National Cooperative Soil Survey (NCSS) Soil
Characterization Database](https://ncsslabdatamart.sc.egov.usda.gov/) codes and specification as much as possible. These are explained in detail in the [**Kellogg Soil Survey Laboratory Methods Manual**](https://www.nrcs.usda.gov/Internet/FSE_DOCUMENTS/stelprdb1253872.pdf)
and [**The Field Book for Describing and Sampling Soils**](https://www.nrcs.usda.gov/wps/portal/nrcs/detail/soils/research/guide/).
Likewise, [**FAO Guidelines for soil description**](http://www.fao.org/3/a0541e/a0541e.pdf),
and the FAO's [GSOC measurement, monitoring, reporting and verification (MRV) protocol](http://www.fao.org/documents/card/en/c/cb0509en/) also explain in
detail how to collect soil samples and setup a system for monitoring soil organic carbon.
It is recommended that one should, as much as possible, use the international standards
and references. Some highly recommended protocols and standards include:
- [UUID generator tool](https://cran.r-project.org/package=uuid) to generate unique ID's for unique soil sites, horizons, samples etc (to convert an existing local ID to UUID, best use `openssl::md5(local_id)`),
- [Open Location Codes](https://opensource.google/projects/open-location-code) to generate geographic location codes,
- [OGC standards](https://www.ogc.org/standards/om) to prepare metadata and exchange data across field / computer systems;
- [ISO8601](https://en.wikipedia.org/wiki/ISO_8601) to save time and date information,
- [ISO3166](https://en.wikipedia.org/wiki/ISO_3166-1) for country / administrative codes,
- [GPS](https://www.gps.gov/) and WGS84 longitude and latitude in decimal degrees to save the location information,
- [International DOI foundation](https://en.wikipedia.org/wiki/Digital_object_identifier) to refer to specific dataset and/or publication,
- [USDA soil classification system](https://www.nrcs.usda.gov/wps/portal/nrcs/main/soils/survey/class/) and [World Reference Base](http://www.fao.org/soils-portal/data-hub/soil-classification/world-reference-base/en/) to classify soils,
- [USDA soil texture calculator](https://www.nrcs.usda.gov/wps/portal/nrcs/detail/soils/survey/?cid=nrcs142p2_054167) to determine and share soil texture classes including texture-by-hand,
- [Kellogg Soil Survey Laboratory Methods Manual](https://www.nrcs.usda.gov/Internet/FSE_DOCUMENTS/stelprdb1253872.pdf) for reference physical and chemical soil property determination in laboratory,
- [Soils Laboratory Manual, K-State Edition](https://kstatelibraries.pressbooks.pub/soilslabmanual/),
- [GLOSOLAN Standard Operating Procedures (SOPs)](http://www.fao.org/global-soil-partnership/glosolan/soil-analysis/standard-operating-procedures/en/#c763834),
## Contributing
Please feel free to contribute entries. See [GitHub
repository](https://github.com/OpenGeoHub/SoilSamples) for more detailed
instructions.
## Contributors
If you contribute, add also your name and Twitter, ORCID or blog link
below:
[Tomislav Hengl](https://twitter.com/tom_hengl), [Jonathan Sanderman](https://twitter.com/sandersoil), [Mario Antonio Guevara
Santamaria](https://orcid.org/0000-0002-9788-9947),
This document is based on the <https://www.bigbookofr.com/> repository
by Oscar Baruffa.
## Disclaimer
The data is provided "as is". [OpenGeoHub foundation](https://opengeohub.org/about) and its suppliers and licensors hereby disclaim all warranties of any kind, express or implied, including, without limitation, the warranties of merchantability, fitness for a particular purpose and non-infringement. Neither OpenGeoHub foundation nor its suppliers and licensors, makes any warranty that the Website will be error free or that access thereto will be continuous or uninterrupted. You understand that you download from, or otherwise obtain content or services through, the Website at your own discretion and risk.
## Licence
This website/book is free to use, and is licensed under the [Creative
Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/).
## Soil Spectroscopy for Global Good
[**SoilSpec4GG**](https://soilspectroscopy.org/) is a USDA-funded [Food and Agriculture Cyberinformatics
Tools Coordinated Innovation Network NIFA Award #2020-67021-32467](https://nifa.usda.gov/press-release/nifa-invests-over-7-million-big-data-artificial-intelligence-and-other) project. It brings together soil
scientists, spectroscopists, informaticians, data scientists and
software engineers to overcome some of the current bottlenecks
preventing wider and more efficient use of soil spectroscopy. For more info refer
to: <https://soilspectroscopy.org/>
## About OpenGeoHub
**OpenGeoHub foundation** is a not-for-profit research foundation
located in Wageningen, the Netherlands. We specifically promote
publishing and sharing of Open Geographical and Geoscientific Data,
using and developing Open Source Software and encouraging and empowering
under-represented researchers e.g. those from ODA recipient countries
and female researchers. We believe that the key measure of quality of
research in all sciences (and especially in geographical information
sciences) is in transparency and reproducibility of the computer code
used to generate results (read more in: ["Everyone has a right to know
what is happening with the planet"](https://opengeohub.medium.com/)).
## Literature
Some other connected publications and initiatives describing collation
and import of legacy soil observations and measurements that might interest
you:
- Arrouays, D., Leenaars, J. G., Richer-de-Forges, A. C., Adhikari,
K., Ballabio, C., Greve, M., ... & Heuvelink, G. (2017). [**Soil
legacy data rescue via GlobalSoilMap and other international and
national initiatives**](https://doi.org/10.1016/j.grj.2017.06.001).
GeoResJ, 14, 1-19.\
- Beillouin, D., Cardinael, R., Berre, D., Boyer, A., Corbeels, M.,
Fallot, A., ... & Demenois, J. (2021). A global overview of studies
about land management, land‐use change, and climate change effects
on soil organic carbon. Global Change Biology. <https://doi.org/10.1111/gcb.15998>\
- Batjes, N. H., Ribeiro, E., van Oostrum, A., Leenaars, J., Hengl,
T., & de Jesus, J. M. (2017). [**WoSIS: providing standardised soil
profile data for the world**](http://www.earth-syst-sci-data.net/9/1/2017/). Earth System Science Data, 9(1), 1. <https://doi.org/10.5194/essd-9-1-2017>\
- Billings, S. A., Lajtha, K., Malhotra, A., Berhe, A. A., de Graaff, M. A.,
Earl, S., ... & Wieder, W. (2021). [**Soil organic carbon is not just for soil
scientists: measurement recommendations for diverse practitioners**](https://doi.org/10.1002/eap.2290). Ecological Applications, 31(3), e02290. <https://doi.org/10.1002/eap.2290>\
- Brown, G., Demeterio, W. and Samuel-Rosa, A. (2021). [**Towards a more open Soil Science**](https://blog.scielo.org/en/2021/01/08/towards-a-more-open-soil-science/). SciELO in Perspective, 2021 [viewed 17 June 2021].
- Gupta, S., Hengl, T., Lehmann, P., Bonetti, S., & Or, D. (2021). [**SoilKsatDB:
global database of soil saturated hydraulic conductivity measurements for
geoscience applications**](https://doi.org/10.5194/essd-13-1593-2021). Earth System Science Data, 13(4), 1593-1612.
<https://doi.org/10.5194/essd-13-1593-2021>\
- Hengl, T., MacMillan, R.A., (2019). [**Predictive Soil Mapping with
R**](https://soilmapper.org/). OpenGeoHub foundation, Wageningen, the
Netherlands, 370 pages, <https://soilmapper.org>, ISBN:
978-0-359-30635-0.\
- Moorberg, C. J., & Crouse, D. A. (2017). [**An open‐source laboratory manual for introductory, undergraduate soil science courses**](https://doi.org/10.4195/nse2017.06.0013). Natural Sciences Education, 46(1), 1-8.\
- Ramcharan, A., Hengl, T., Beaudette, D., & Wills, S. (2017). [**A soil
bulk density pedotransfer function based on machine learning: A case
study with the NCSS soil characterization
database**](https://doi.org/10.2136/sssaj2016.12.0421). Soil Science
Society of America Journal, 81(6), 1279-1287.
<https://doi.org/10.2136/sssaj2016.12.0421>
- Rossiter, D.G.,: [**Compendium of Soil Geographical
Databases**](https://www.isric.org/explore/soil-geographic-databases).