Improving Open Information Extraction via Iterative Rank-Aware Learning

05/31/2019
by   Zhengbao Jiang, et al.
0

Open information extraction (IE) is the task of extracting open-domain assertions from natural language sentences. A key step in open IE is confidence modeling, ranking the extractions based on their estimated quality to adjust precision and recall of extracted assertions. We found that the extraction likelihood, a confidence measure used by current supervised open IE systems, is not well calibrated when comparing the quality of assertions extracted from different sentences. We propose an additional binary classification loss to calibrate the likelihood to make it more globally comparable, and an iterative learning process, where extractions generated by the open IE model are incrementally included as training samples to help the model learn from trial and error. Experiments on OIE2016 demonstrate the effectiveness of our method. Code and data are available at https://github.com/jzbjyb/oie_rank.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2020

SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction

Open relation extraction is the task of extracting open-domain relation ...
research
01/30/2019

Span Based Open Information Extraction

In this paper, we propose a span based model combined with syntactic inf...
research
07/10/2023

Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation

Large pre-trained language models achieve impressive results across many...
research
05/16/2023

Easy-to-Hard Learning for Information Extraction

Information extraction (IE) systems aim to automatically extract structu...
research
04/28/2020

Joint Keyphrase Chunking and Salience Ranking with BERT

An effective keyphrase extraction system requires to produce self-contai...
research
03/11/2019

Un duel probabiliste pour départager deux présidents (LIA @ DEFT'2005)

We present a set of probabilistic models applied to binary classificatio...
research
05/01/2020

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Decisions of complex language understanding models can be rationalized b...

Please sign up or login with your details

Forgot password? Click here to reset