Incremental Knowledge Base Construction Using DeepDive

02/03/2015
∙
by   Jaeho Shin, et al.
∙
0
∙

Populating a database with unstructured information is a long-standing problem in industry and research that encompasses problems of extraction, cleaning, and integration. Recent names used for this problem include dealing with dark data and knowledge base construction (KBC). In this work, we describe DeepDive, a system that combines database and machine learning ideas to help develop KBC systems, and we present techniques to make the KBC process more efficient. We observe that the KBC process is iterative, and we develop techniques to incrementally produce inference results for KBC systems. We propose two methods for incremental inference, based respectively on sampling and variational techniques. We also study the tradeoff space of these methods and develop a simple rule-based optimizer. DeepDive includes all of these contributions, and we evaluate DeepDive on five KBC systems, showing that it can speed up KBC inference tasks by up to two orders of magnitude with negligible impact on quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
∙ 09/19/2018

Towards the Development of a Rule-based Drought Early Warning Expert Systems using Indigenous Knowledge

Drought forecasting and prediction is a complicated process due to the c...
research
∙ 09/10/2017

AppTechMiner: Mining Applications and Techniques from Scientific Articles

This paper presents AppTechMiner, a rule-based information extraction fr...
research
∙ 08/01/2011

Scaling Inference for Markov Logic with a Task-Decomposition Approach

Motivated by applications in large-scale knowledge base construction, we...
research
∙ 07/11/2011

Rule-based query answering method for a knowledge base of economic crimes

We present a description of the PhD thesis which aims to propose a rule-...
research
∙ 11/15/2018

Combining Axiom Injection and Knowledge Base Completion for Efficient Natural Language Inference

In logic-based approaches to reasoning tasks such as Recognizing Textual...
research
∙ 03/13/2013

Integrating Model Construction and Evaluation

To date, most probabilistic reasoning systems have relied on a fixed bel...
research
∙ 06/30/2023

Multi-Dialectal Representation Learning of Sinitic Phonology

Machine learning techniques have shown their competence for representing...

Please sign up or login with your details

Forgot password? Click here to reset