Integrating Local Context and Global Cohesiveness for Open Information Extraction

04/26/2018
by   Qi Zhu, et al.
0

Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their relation) from sentences, and do not confine to a pre-defined schema for the relations of interests. However, current open IE systems focus on modeling local context information in a sentence to extract relation tuples, while ignoring the fact that global statistics in a large corpus can be collectively leveraged to identify high-quality sentence-level extractions. In this paper, we propose a novel open IE system, called ReMine, which integrates local context signal and global structural signal in a unified framework with distant supervision. The new system can be efficiently applied to different domains as it uses facts from external knowledge bases as supervision; and can effectively score sentence-level tuple extractions based on corpus-level statistics. Specifically, we design a joint optimization problem to unify (1) segmenting entity/relation phrases in individual sentences based on local context; and (2) measuring the quality of sentence-level extractions with a translating-based objective. Experiments on two real-world corpora from different domains demonstrate the effectiveness and robustness of ReMine when compared to other open IE systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2018

Open Information Extraction with Global Structure Constraints

Extracting entities and their relations from text is an important task f...
research
10/27/2016

CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases

Extracting entities and relations for types of interest from text is imp...
research
06/15/2020

Extracting N-ary Cross-sentence Relations using Constrained Subsequence Kernel

Most of the past work in relation extraction deals with relations occurr...
research
08/24/2018

Reinforcement Learning for Relation Classification from Noisy Data

Existing relation classification methods that rely on distant supervisio...
research
04/29/2019

Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction

In this paper, we consider the problem of open information extraction (O...
research
09/15/2021

AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark

Open Information Extraction (OIE) is the task of extracting facts from s...
research
01/24/2021

A Novel Two-stage Framework for Extracting Opinionated Sentences from News Articles

This paper presents a novel two-stage framework to extract opinionated s...

Please sign up or login with your details

Forgot password? Click here to reset