Knowledge Graph for Microdata of Statistics Netherlands

01/19/2021
by   Chang Sun, et al.
0

Statistics Netherlands (CBS) hosted a huge amount of data not only on the statistical level but also on the individual level. With the development of data science technologies, more and more researchers request to conduct their research by using high-quality individual data from CBS (called CBS Microdata) or combining them with other data sources. Making great use of these data for research and scientific purposes can tremendously benefit the whole society. However, CBS Microdata has been collected and maintained in different ways by different departments in and out of CBS. The representation, quality, metadata of datasets are not sufficiently harmonized. The project converts the descriptions of all CBS microdata sets into one knowledge graph with comprehensive metadata in Dutch and English using text mining and semantic web technologies. Researchers can easily query the metadata, explore the relations among multiple datasets, and find the needed variables. For example, if a researcher searches a dataset about "Age at Death" in the Health and Well-being category, all information related to this dataset will appear including keywords and variable names. "Age at Death" dataset has a keyword - "Death". This keyword will lead to other datasets such as "Date of Death". "Cause of Death", "Production statistics Health and welfare" from Population, Business categories, and Health and well-being categories. This will tremendously save time and costs for the data requester but also data maintainers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2019

The FAIR Funder pilot programme to make it easy for funders to require and for grantees to produce FAIR Data

There is a growing acknowledgement in the scientific community of the im...
research
12/11/2018

Text data mining and data quality management for research information systems in the context of open data and open science

In the implementation and use of research information systems (RIS) in s...
research
08/07/2022

Data Leaves: Scenario-oriented Metadata for Data Federative Innovation

A method for representing the digest information of each dataset is prop...
research
04/09/2018

Recommendation System of Grants-in-Aid for Researchers by using JSPS Keyword

An acquisition of a research grant is important for the researchers to c...
research
03/15/2020

On new data sources for the production of official statistics

In the past years we have witnessed the rise of new data sources for the...
research
02/11/2020

Two Huge Title and Keyword Generation Corpora of Research Articles

Recent developments in sequence-to-sequence learning with neural network...
research
05/13/2022

An Approach for Automatic Construction of an Algorithmic Knowledge Graph from Textual Resources

There is enormous growth in various fields of research. This development...

Please sign up or login with your details

Forgot password? Click here to reset