An Overview of Distant Supervision for Relation Extraction with a Focus on Denoising and Pre-training Methods

07/17/2022
by   William Hogan, et al.
0

Relation Extraction (RE) is a foundational task of natural language processing. RE seeks to transform raw, unstructured text into structured knowledge by identifying relational information between entity pairs found in text. RE has numerous uses, such as knowledge graph completion, text summarization, question-answering, and search querying. The history of RE methods can be roughly organized into four phases: pattern-based RE, statistical-based RE, neural-based RE, and large language model-based RE. This survey begins with an overview of a few exemplary works in the earlier phases of RE, highlighting limitations and shortcomings to contextualize progress. Next, we review popular benchmarks and critically examine metrics used to assess RE performance. We then discuss distant supervision, a paradigm that has shaped the development of modern RE methods. Lastly, we review recent RE works focusing on denoising and pre-training methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2020

Denoising Relation Extraction from Document-level Distant Supervision

Distant supervision (DS) has been widely used to generate auto-labeled d...
research
05/18/2022

Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision

Contrastive pre-training on distant supervision has shown remarkable eff...
research
06/03/2023

A Comprehensive Survey on Deep Learning for Relation Extraction: Recent Advances and New Frontiers

Relation extraction (RE) involves identifying the relations between enti...
research
01/06/2021

Deep Neural Network Based Relation Extraction: An Overview

Knowledge is a formal way of understanding the world, providing a human-...
research
03/24/2018

Simple Large-scale Relation Extraction from Unstructured Text

Knowledge-based question answering relies on the availability of facts, ...
research
05/26/2020

A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction

Fact triples are a common form of structured knowledge used within the b...
research
08/18/2020

An Annotated Corpus of Webtables for Information Extraction Tasks

Information Extraction is a well-researched area of Natural Language Pro...

Please sign up or login with your details

Forgot password? Click here to reset