Mining Rules Incrementally over Large Knowledge Bases

04/20/2019
by   Xiaofeng Zhou, et al.
0

Multiple web-scale Knowledge Bases, e.g., Freebase, YAGO, NELL, have been constructed using semi-supervised or unsupervised information extraction techniques and many of them, despite their large sizes, are continuously growing. Much research effort has been put into mining inference rules from knowledge bases. To address the task of rule mining over evolving web-scale knowledge bases, we propose a parallel incremental rule mining framework. Our approach is able to efficiently mine rules based on the relational model and apply updates to large knowledge bases; we propose an alternative metric that reduces computation complexity without compromising quality; we apply multiple optimization techniques that reduce runtime by more than 2 orders of magnitude. Experiments show that our approach efficiently scales to web-scale knowledge bases and saves over 90 mining system. We also apply our optimization techniques to the batch rule mining algorithm, reducing runtime by more than half compared to the state-of-the-art. To the best of our knowledge, our incremental rule mining system is the first that handles updates to web-scale knowledge bases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2015

RDF2Rules: Learning Rules from RDF Knowledge Bases by Mining Frequent Predicate Cycles

Recently, several large-scale RDF knowledge bases have been built and ap...
research
11/04/2019

REMI: Mining Intuitive Referring Expressions on Knowledge Bases

A referring expression (RE) is a description that identifies a set of in...
research
04/25/2010

Towards Closed World Reasoning in Dynamic Open Worlds (Extended Version)

The need for integration of ontologies with nonmonotonic rules has been ...
research
11/09/2020

Batchwise Probabilistic Incremental Data Cleaning

Lack of data and data quality issues are among the main bottlenecks that...
research
11/03/2017

Mandolin: A Knowledge Discovery Framework for the Web of Data

Markov Logic Networks join probabilistic modeling with first-order logic...
research
03/27/2013

The Automatic Training of Rule Bases that Use Numerical Uncertainty Representations

The use of numerical uncertainty representations allows better modeling ...
research
06/11/2022

Incremental Information Gain Mining Of Temporal Relational Streams

This paper studies the problem of mining for data values with high infor...

Please sign up or login with your details

Forgot password? Click here to reset