You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Digitising is the process of tracing features from a georeferenced map and converting them into digital points, lines, or— in our case—polygons that a GIS can interpret. In this tutorial, we will digitise the language areas shown on the Alor-Pantar map by Schapper (2020), which we georeferenced in the [Georeferencing tutorial](../georeferencing/index.md). This tutorial focuses on copying the geometry of the language polygons from the map. For information on how to copy attributes (including Glottocodes) and metadata, see the [Attributes and Metadata](../metadata/index.md) and the [Glottocodes tutorial](../glottocodes/index.md).
@@ -24,14 +14,14 @@ This method involves splitting language areas from an existing polygon dataset o
24
14
Finally, we [merge the digitised individual polygons into Multipolygons](#merging-polygons-by-language-name) grouped by language name. Some languages are represented by several disjoint polygons, which we combine into a single Multipolygon geometry for easier handling.
25
15
26
16
27
-
## Requirements
17
+
###Requirements
28
18
**Software**: [QGIS](https://qgis.org) is a free and open-source geographic information system (GIS). This tutorial uses version **QGIS 3.34.4-Prizren**.
29
19
30
20
**Data**: A georeferenced map in GeoTIFF format. In this tutorial, we digitise the Alor-Pantar languages map from the [Georeferencing tutorial](../georeferencing/index.md), whose GeoTIFF can be downloaded [here](../georeferencing/out/Map%203%20The%20Alor-Pantar%20languages.tiff). For part B, we also use **Land polygons, including major islands**, from the [1:10m Physical Vectors dataset by Natural Earth](https://www.naturalearthdata.com/downloads/10m-physical-vectors/). These data are provided as Shapefiles, a widely used legacy format for storing geographic vector data.
31
21
32
22
33
23
34
-
## Digitising language polygons from scratch
24
+
###Digitising language polygons from scratch
35
25
36
26
Before digitising, we need to load the **georeferenced raster map** of the Alor Pantar languages we created in the [Georeferencing tutorial](../georeferencing/index.md). Go to **Layer** > **Add Layer** > **Add Raster Layer...** and locate the file, or drag and drop the file into the **Layers** panel.
37
27
@@ -68,7 +58,7 @@ In the dialog box, click the `...` button next to **Database** to choose a locat
68
58
69
59
For a detailed explanation of all attributes needed for Glottography polygons, see the [Attributes and Metadata tutorial](../metadata/index.md). In the **Advanced** section, change the name of the **Feature id column** to `id`. Once all fields are defined, click **OK** to create the GeoPackage layer.
70
60
71
-
### Start Digitising
61
+
####Start Digitising
72
62
73
63
Ensure that your new GeoPackage layer is selected in the **Layers** panel. Enable **Edit Mode** by clicking the **pencil icon** in the toolbar, or right-click the layer and select **Toggle Editing**.
74
64
@@ -127,7 +117,7 @@ Don’t forget to save your QGIS project regularly by going to **Project** > **S
127
117
When you have finished digitising, head to section [Merging polygons by language name](#merging-polygons-by-language-name) to combine the digitised individual polygons into Multipolygons grouped by language name.
128
118
129
119
130
-
### Snapping
120
+
####Snapping
131
121
132
122
To improve the accuracy of your digitising when adding polygons, you can enable **snapping** so that new features align precisely with existing ones. First, make sure the **Snapping Toolbar** is visible. Go to **View** > **Toolbars** > **Snapping Toolbar**. In the **Snapping Toolbar**, click the **magnet icon** to enable snapping.
133
123
@@ -150,7 +140,7 @@ Set the **snapping distance**. Typically 10 pixels is a good starting point. Nex
150
140
Digitising now works as before, but with one key difference: QGIS will automatically snap new vertices to nearby existing ones when in snapping distance, helping you maintain clean, topologically correct boundaries.
151
141
152
142
153
-
### Cutting enclave language polygons
143
+
####Cutting enclave language polygons
154
144
155
145
In some cases, a language area may be completely surrounded by another. An example is _Pennsylvania Dutch_ in the USA, entirely enclosed by English-speaking regions. For such a language enclave, we first cut a hole in the surrounding language area and then fill that hole with a new polygon representing the enclave language. This approach is suited for very specific cases and may not be commonly needed. However, in situations where an isolated language enclave exists, it is often the only viable method. While there is no such enclave on the Alor-Pantar language map, we will briefly walk through the steps required to create one. Click the **Toggle Editing** icon to start editing. Ensure the **Advanced Digitising Toolbar** is active. If not, go to **View** > **Toolbars** > **Advanced Digitising Toolbar**. In the **Advanced Digitising Toolbar**, click the **Fill Ring** icon.
156
146
@@ -180,7 +170,7 @@ You can now select the filled ring to verify that the tool created a new polygon
180
170
181
171
182
172
183
-
## Splitting language polygons from existing landforms
173
+
###Splitting language polygons from existing landforms
184
174
185
175
This method splits language areas from an existing polygon dataset of continents and landforms. We use the **Land polygons including major islands** shapefile from the [1:10m Physical Vectors by Natural Earth](https://www.naturalearthdata.com/downloads/10m-physical-vectors/). To load the shapefile in QGIS, go to **Layer** > **Add Layer** > **Add Vector Layer...**, browse to the file location of the Natural Earth land polygon shapefile, and click **Add**. Alternatively, you can simply drag and drop the shapefile into the **Layers** panel in QGIS.
186
176
@@ -193,7 +183,7 @@ This method splits language areas from an existing polygon dataset of continents
193
183
194
184
We can already see that the Natural Earth land polygons are not detailed enough for this region — they are missing the Pura and Treweng islands. While we likely wouldn’t use this dataset for digitising this language map, we will carry on for the sake of demonstrating how to cut language polygons.
195
185
196
-
### Preparing the Base Layer for Digitising
186
+
####Preparing the Base Layer for Digitising
197
187
198
188
The Natural Earth land polygons will serve as the base from which we cut out the digitised language areas. We need to prepare this vector layer first. Shapefiles are a clunky, legacy format that store geometry and attribute data across separate files. To streamline our workflow, we convert the layer to a **GeoPackage**. Right-click the layer in the **Layers** panel and go to **Export** > **Save Features As...**.
199
189
@@ -213,7 +203,7 @@ A dialog appears. Set the **Format** to `GeoPackage`, and specify the **File nam
213
203
214
204
215
205
216
-
### Multipolygons to single parts
206
+
####Multipolygons to single parts
217
207
218
208
The Natural Earth polygons are stored as large **Multipolygons**, each containing potentially hundreds of single polygons. Before digitising, we need to separate these into their individual components using the **Multipart to Singleparts** tool. Go to **Processing Toolbox** > **Vector geometry** > **Multipart to Singleparts...**.
219
209
@@ -235,7 +225,7 @@ In the dialog, set the **Input layer** to the Natural Earth land polygons and de
235
225
236
226
Click **Run**. This will generate a new layer where each polygon becomes a separate feature.
237
227
238
-
### Cropping to the Language Map Region
228
+
####Cropping to the Language Map Region
239
229
240
230
Next, we crop the single-part land polygons to the region covered by our language map. Click the **Select Features by Polygon** icon in the toolbar.
241
231
@@ -277,7 +267,7 @@ In the dialog, save the selected features as a **GeoPackage**. Use the name of t
277
267
We have now cropped the Natural Earth polygons to only those overlapping with the language map.
278
268
279
269
280
-
### Editing the Attribute Table
270
+
####Editing the Attribute Table
281
271
282
272
Next, we prepare the attribute table by removing irrelevant fields and adding those required by Glottography. Right-click the cropped layer and select **Open Attribute Table**.
283
273
@@ -345,7 +335,7 @@ Repeat the process to add the remaining fields `name`, `full_map_name`, `year`,
345
335
Your layer is now ready for splitting off language areas.
346
336
347
337
348
-
### Splitting Language Polygons
338
+
####Splitting Language Polygons
349
339
350
340
To better see the map beneath, adjust the visual appearance of the cropped layer. Right-click the layer and select **Properties**.
351
341
@@ -395,7 +385,7 @@ To trace the Kiraman language area, start in the ocean southwest of the landmass
395
385
If you receive the error message: **"No features were split: If there are selected features, the split tool only applies to those..."**, this usually means the wrong feature was selected. Click **Deselect Features from All Layers** to fix the issue.
396
386
397
387
398
-
### Adding Attribute Information
388
+
####Adding Attribute Information
399
389
400
390
After tracing a language polygon, fill in the attribute fields. Use the **Identify Features** tool and ensure only the new polygon is highlighted.
401
391
@@ -426,7 +416,7 @@ A form will open where you can enter the relevant attribute data for the Kiraman
426
416
427
417
Click **OK** when done. Repeat the process until all language polygons are split from the Natural Earth land polygons.
428
418
429
-
## Merging polygons by language name
419
+
###Merging polygons by language name
430
420
431
421
Some languages are represented by several disjoint polygons. For example, the Wersing language occurs in Multiple polygons in the northeast, east, and southeast of Alor Island. We merge all digitised individual polygons into a single geometry — a Multipolygon — based on shared language name. Note, however, that this approach assumes the name uniquely identifies a language. If different languages share the same name, the merge into Multipolygons must instead be based on another identifier or attribute.
432
422
@@ -457,16 +447,6 @@ We can verify that Wersing, for example, is represented by a single geometry. N
457
447
<figcaption><em>The individual polygons of the Wersing language merged into a single Multipolygon.</em></figcaption>
458
448
</figure>
459
449
460
-
## Output
461
-
462
-
A GeoPackage file containing language polygons and attributes (see [Attributes and metadata](../metadata/index.md) and [Glottocodes tutorial](../glottocodes/index.md)). The Alor–Pantar language polygons, digitised in this tutorial, can be downloaded [here](out/schapper2020papuan.gpkg).
A GeoPackage file containing language polygons and attributes (see [Attributes and metadata](../metadata/index.md) and [Glottocodes tutorial](../glottocodes/index.md)). The Alor–Pantar language polygons, digitised in this tutorial, can be downloaded [here](out/schapper2020papuan.gpkg).
0 commit comments