Enriching Wikidata with Linked Open Data

07/01/2022
by   Bohui Zhang, et al.
0

Large public knowledge graphs, like Wikidata, contain billions of statements about tens of millions of entities, thus inspiring various use cases to exploit such knowledge graphs. However, practice shows that much of the relevant information that fits users' needs is still missing in Wikidata, while current linked open data (LOD) tools are not suitable to enrich large graphs like Wikidata. In this paper, we investigate the potential of enriching Wikidata with structured data sources from the LOD cloud. We present a novel workflow that includes gap detection, source selection, schema alignment, and semantic validation. We evaluate our enrichment method with two complementary LOD sources: a noisy source with broad coverage, DBpedia, and a manually curated source with narrow focus on the art domain, Getty. Our experiments show that our workflow can enrich Wikidata with millions of novel statements from external LOD sources with a high quality. Property alignment and data quality are key challenges, whereas entity alignment and source selection are well-supported by existing Wikidata mechanisms. We make our code and data available to support future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

Towards Neural Schema Alignment for OpenStreetMap and Knowledge Graphs

OpenStreetMap (OSM) is one of the richest openly available sources of vo...
research
02/23/2020

Path Outlines: Browsing Path-Based Summaries of Linked Open Datasets

Linked Data (LD) are structured sources of information, such as DBpedia ...
research
12/08/2019

Data Exploration and Validation on dense knowledge graphs for biomedical research

Here we present a holistic approach for data exploration on dense knowle...
research
03/07/2016

TruthDiscover: Resolving Object Conflicts on Massive Linked Data

Considerable effort has been made to increase the scale of Linked Data. ...
research
04/17/2020

Duplication Detection in Knowledge Graphs: Literature and Tools

In recent years, an increasing amount of knowledge graphs (KGs) have bee...
research
07/01/2021

A Study of the Quality of Wikidata

Wikidata has been increasingly adopted by many communities for a wide va...
research
09/30/2017

Towards Understanding the Evolution of Vocabulary Terms in Knowledge Graphs

Vocabularies are used for modeling data in Knowledge Graphs (KG) like th...

Please sign up or login with your details

Forgot password? Click here to reset