How Stable is Knowledge Base Knowledge?

11/02/2022
by   Suhas Shrinivasan, et al.
0

Knowledge Bases (KBs) provide structured representation of the real-world in the form of extensive collections of facts about real-world entities, their properties and relationships. They are ubiquitous in large-scale intelligent systems that exploit structured information such as in tasks like structured search, question answering and reasoning, and hence their data quality becomes paramount. The inevitability of change in the real-world, brings us to a central property of KBs – they are highly dynamic in that the information they contain are constantly subject to change. In other words, KBs are unstable. In this paper, we investigate the notion of KB stability, specifically, the problem of KBs changing due to real-world change. Some entity-property-pairs do not undergo change in reality anymore (e.g., Einstein-children or Tesla-founders), while others might well change in the future (e.g., Tesla-board member or Ronaldo-occupation as of 2022). This notion of real-world grounded change is different from other changes that affect the data only, notably correction and delayed insertion, which have received attention in data cleaning, vandalism detection, and completeness estimation already. To analyze KB stability, we proceed in three steps. (1) We present heuristics to delineate changes due to world evolution from delayed completions and corrections, and use these to study the real-world evolution behaviour of diverse Wikidata domains, finding a high skew in terms of properties. (2) We evaluate heuristics to identify entities and properties likely to not change due to real-world change, and filter inherently stable entities and properties. (3) We evaluate the possibility of predicting stability post-hoc, specifically predicting change in a property of an entity, finding that this is possible with up to 83 F1 score, on a balanced binary stability prediction task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2022

Improving Entity Disambiguation by Reasoning over a Knowledge Base

Recent work in entity disambiguation (ED) has typically neglected struct...
research
03/07/2020

Knowledge Graphs and Knowledge Networks: The Story in Brief

Knowledge Graphs (KGs) represent real-world noisy raw information in a s...
research
04/30/2018

Demand-Weighted Completeness Prediction for a Knowledge Base

In this paper we introduce the notion of Demand-Weighted Completeness, a...
research
09/20/2017

Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties [Extended Version]

In knowledge bases such as Wikidata, it is possible to assert a large se...
research
10/28/2022

System Network Analytics: Evolution and Stable Rules of a State Series

System Evolution Analytics on a system that evolves is a challenge becau...
research
02/03/2017

Named Entity Evolution Analysis on Wikipedia

Accessing Web archives raises a number of issues caused by their tempora...
research
11/30/2018

Completeness and Consistency Analysis for Evolving Knowledge Bases

Assessing the quality of an evolving knowledge base is a challenging tas...

Please sign up or login with your details

Forgot password? Click here to reset