STUNDA = Sveriges Tekniska Universitets Nätverk för Datatermer
A dump of the data is stored in two formats,
stunda-terms.jsonlstunda-terms.tsv
The part-of-speech (POS) information is in most cases guessed, marked N?
The file was created with the script described below.
rawdata: data copied from heterogeneous sources with no common format required. The starting point of the project.scripts: scripts analysing and converting rawdata
See instructions at the beginning of scripts/convert_data.py