Knowledge Completion for Generics using Guided Tensor Factorization

12/12/2016
by   Hanie Sedghi, et al.
0

Given a knowledge base (KB) rich in facts about common nouns or generics, such as "all trees produce oxygen" or "some animals live in forests", we consider the problem of deriving additional such facts at a high precision. While this problem has received much attention for named entity KBs such as Freebase, little emphasis has been placed on generics despite their importance for capturing general knowledge. Different from named entity KBs, generics KBs involve implicit or explicit quantification, have more complex underlying regularities, are substantially more incomplete, and violate the commonly used locally closed world assumption (LCWA). Consequently, existing completion methods struggle with this new task. We observe that external information, such as relation schemas and entity taxonomies, if used correctly, can be surprisingly powerful in addressing the challenges associated with generics. Using this insight, we propose a simple yet effective knowledge guided tensor factorization approach that achieves state-of-the-art results on two generics KBs for science, doubling their size at 74%-86% precision. Further, to address the paucity of facts about rare entities such as oriole (a bird), we present a novel taxonomy guided submodular active learning method to collect additional annotations that are over five times more effective in inferring further new facts than multiple active learning baselines.

READ FULL TEXT
research
03/15/2017

Sparse Named Entity Classification using Factorization Machines

Named entity classification is the task of classifying text-based elemen...
research
03/18/2023

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking

Biomedical entity linking (EL) consists of named entity recognition (NER...
research
09/03/2019

Modeling Named Entity Embedding Distribution into Hypersphere

This work models named entity distribution from a way of visualizing top...
research
04/21/2019

Fact Discovery from Knowledge Base via Facet Decomposition

During the past few decades, knowledge bases (KBs) have experienced rapi...
research
10/20/2020

Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

A challenge for named entity disambiguation (NED), the task of mapping t...
research
10/15/2021

Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Named entity disambiguation (NED), which involves mapping textual mentio...

Please sign up or login with your details

Forgot password? Click here to reset