Challenges of Linking Organizational Information in Open Government Data to Knowledge Graphs

08/14/2020
by   Jan Portisch, et al.
0

Open Government Data (OGD) is being published by various public administration organizations around the globe. Within the metadata of OGD data catalogs, the publishing organizations (1) are not uniquely and unambiguously identifiable and, even worse, (2) change over time, by public administration units being merged or restructured. In order to enable fine-grained analyses or searches on Open Government Data on the level of publishing organizations, linking those from OGD portals to publicly available knowledge graphs (KGs) such as Wikidata and DBpedia seems like an obvious solution. Still, as we show in this position paper, organization linking faces significant challenges, both in terms of available (portal) metadata and KGs in terms of data quality and completeness. We herein specifically highlight five main challenges, namely regarding (1) temporal changes in organizations and in the portal metadata, (2) lack of a base ontology for describing organizational structures and changes in public knowledge graphs, (3) metadata and KG data quality, (4) multilinguality, and (5) disambiguating public sector organizations. Based on available OGD portal metadata from the Open Data Portal Watch, we provide an in-depth analysis of these issues, make suggestions for concrete starting points on how to tackle them along with a call to the community to jointly work on these open challenges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2021

Open Data and the Status Quo – A Fine-Grained Evaluation Framework for Open Data Quality and an Analysis of Open Data portals in Germany

This paper presents a framework for assessing data and metadata quality ...
research
05/23/2017

Calidad en repositorios digitales en Argentina, estudio comparativo y cualitativo

Numerous institutions and organizations need not only to preserve the ma...
research
11/04/2019

Spatial Search Strategies for Open Government Data: A Systematic Comparison

The increasing availability of open government datasets on the Web calls...
research
08/13/2023

Modeling the Dashboard Provenance

Organizations of all kinds, whether public or private, profit-driven or ...
research
08/04/2017

Exploiting Redundancy, Recurrence and Parallelism: How to Link Millions of Addresses with Ten Lines of Code in Ten Minutes

Accurate and efficient record linkage is an open challenge of particular...
research
09/30/2017

Towards Understanding the Evolution of Vocabulary Terms in Knowledge Graphs

Vocabularies are used for modeling data in Knowledge Graphs (KG) like th...
research
07/30/2019

Classi-Fly: Inferring Aircraft Categories from Open Data

In recent years, air traffic communication data has become easy to acces...

Please sign up or login with your details

Forgot password? Click here to reset