You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: digitising/index.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,4 @@
1
-
# Digitising language polygons in QGIS
1
+
# Digitising language polygons
2
2
3
3
Digitising is the process of tracing features from a georeferenced map and converting them into digital points, lines, or— in our case—polygons that a GIS can interpret. In this tutorial, we will digitise the language areas shown on the Alor-Pantar map by Schapper (2020), which we georeferenced in the [Georeferencing tutorial](../georeferencing/index.md). This tutorial focuses on copying the geometry of the language polygons from the map. For information on how to copy attributes (including Glottocodes) and metadata, see the [Attributes and Metadata](../metadata/index.md) and the [Glottocodes tutorial](../glottocodes/index.md).
This tutorial introduces the attributes and metadate required when digitising Glottography language areas from source publications. Glottography uses BibTeX entries to uniquely reference each source publication, and Glottocodes to identify the languages depicted in their maps. Because Glottocodes were introduced only relatively recently, many source publications — especially older ones — likely do not include them. As a result, assigning the correct Glottocodes to a language area can be time-consuming and may require additional effort. To assist with this process, a separate [Glottocode tutorial](../glottocodes/index.md) explains how to automatically query and assign Glottocodes to a language area based on language name and geographic location.
14
4
15
5
## Requirements
16
-
**Software**: [QGIS](https://qgis.org) is a free and open-source geographic information system (GIS). This tutorial uses version **QGIS 3.34.4-Prizren**.
6
+
**Software**: [QGIS](https://qgis.org) is a free and open-source geographic information system (GIS). This tutorial uses version QGIS 3.34.4-Prizren.
17
7
18
8
**Data:** Digitised language polygons in GeoPackage format (`.gpkg`). In this tutorial, we use the digitised Alor–Pantar language polygons from the [Digitising tutorial](../digitising/index.md), which can be downloaded [here](../digitising/out/schapper2020papuan.gpkg).
19
9
20
10
21
-
## Metadata
22
-
### Referencing the source publication
11
+
###Metadata
12
+
####Referencing the source publication
23
13
24
14
A source publication is any piece of work that includes a language map we wish to incorporate into Glottography. For example, the map of the Alor and Pantar languages that we [georeferenced](../georeferencing/index.md) and [digitised](../digitising/index.md) earlier was taken from the book chapter *Introduction to The Papuan Languages of Timor, Alor and Pantar* (https://doi.org/10.1515/9781501511158-001) by Schapper (2020). Glottography records source publications using the BibTeX format. BibTeX stores metadata on bibliography items, such as articles, books, and theses, in plain text files, typically with the `.bib` file extension. A single BibTeX file can contain multiple entries, but for our purposes, each file must contain exactly one. The publication by Schapper (2020) has the following BibTeX entry:
25
15
@@ -44,7 +34,7 @@ A source publication is any piece of work that includes a language map we wish t
44
34
The BibTeX entry consists of the entry type (e.g., `@inbook`), a citation key (e.g., `schapper2020papuan`), and several fields or tags that define the details of the publication, such as `author`, `title`, and `url`. The type `@inbook` indicates that the source is a chapter in a book. Similarly, `@article` is used for journal articles, and `@book` for standalone books. Glottography requires the citation key to follow the format `authorYYYYtopic`, and the corresponding BibTeX file must be named `authorYYYYtopic.bib`, for example, `schapper2020papuan.bib`. The Glottography import script in the [data curation tutorial](../curation/index.md) depends on both the citation key and the file name being in this exact format to function correctly.
45
35
46
36
47
-
### How to obtain a BibTeX Entry
37
+
####How to obtain a BibTeX Entry
48
38
There are four main ways to obtain a BibTeX entry for a source publication, which we will briefly cover here.
49
39
50
40
The easiest method is likely to check Glottolog. By visiting the [Glottolog website](https://glottolog.org/), you can use the "Reference Search" feature to enter bibliographic information such as the author’s name and publication year. Once you retrieve the relevant entry, you can open it, copy the BibTeX citation, and paste it into a plain text editor like Notepad++. You'll then need to adjust the title and citation key to follow the required `authorYYYYtopic` format and save the file under the same name with a `.bib` extension.
@@ -56,8 +46,8 @@ A third method is to use [Google Scholar](https://scholar.google.com/). You can
56
46
Finally, if you are familiar with BibTeX syntax, you can simply write the entry by hand. Regardless of which method you choose, the end result should be a `.bib` file containing a single entry. The file name must follow the `authorYYYYtopic.bib` format, and the citation key inside the file must match the file name.
57
47
58
48
59
-
## Attributes
60
-
### Collecting attribute data for the language areas
49
+
###Attributes
50
+
####Collecting attribute data for the language areas
61
51
62
52
When [digitising language areas](../digitising/index.md), **Glottography** requires attribute data describing the languages each area represents. This attribute data is usually recorded during digitisation but can also be added later. The following attributes must be included:
63
53
@@ -88,7 +78,7 @@ The table below shows the attributes for selected language areas from the Alor-P
88
78
89
79
90
80
91
-
### Exporting the attribute data as CSV
81
+
####Exporting the attribute data as CSV
92
82
93
83
Glottography expects attribute data to be exported to a separate CSV file, where each row corresponds to a digitised language area and is linked to the polygon geometry via a unique identifier (`id`). Keeping metadata in plain-text CSV format makes it easy to update and manage. It is both human-readable and compatible with version control tools like Git. The GeoPackage file can easily be exported as a CSV in QGIS. The following steps assume that QGIS is open and that all language areas and their polygons have been recorded and saved in a GeoPackage, following the [Digitising Tutorial](../digitising/index.md).
94
84
@@ -114,19 +104,9 @@ In the dialog that appears configure the export settings. Set **Format** to `Com
114
104
115
105
116
106
117
-
## Output
107
+
###Output
118
108
A BibTeX file containing a reference to the source publication in BibTeX format. The BibTeX file for the Alor–Pantar language map can be downloaded [here](out/schapper2020papuan.bib).
119
109
120
110
A GeoPackage file containing the language polygons (see the [Digitising tutorial](../digitising/index.md)) and attributes (see also the [Glottocodes tutorial](../glottocodes/index.md)). The Alor–Pantar language polygons including atttribute data can be downloaded [here](../digitising/out/schapper2020papuan.gpkg).
121
111
122
112
A CSV file containing the attribute data, linked to the digitised polygons via the `id` column. The CSV file for the Alor–Pantar language polygons can be downloaded [here](out/schapper2020papuan.csv).
0 commit comments