Bootstrapping Relation Extractors using Syntactic Search by Examples

02/09/2021
by   Matan Eyal, et al.
0

The advent of neural-networks in NLP brought with it substantial improvements in supervised relation extraction. However, obtaining a sufficient quantity of training data remains a key challenge. In this work we propose a process for bootstrapping training datasets which can be performed quickly by non-NLP-experts. We take advantage of search engines over syntactic-graphs (Such as Shlain et al. (2020)) which expose a friendly by-example syntax. We use these to obtain positive examples by searching for sentences that are syntactically similar to user input examples. We apply this technique to relations from TACRED and DocRED and show that the resulting models are competitive with models trained on manually annotated data and on data obtained from distant supervision. The models also outperform models trained using NLG data augmentation techniques. Extending the search-based approach with the NLG method further improves the results.

READ FULL TEXT
research
09/12/2015

Improving distant supervision using inference learning

Distant supervision is a widely applied approach to automatic training o...
research
05/18/2023

Silver Syntax Pre-training for Cross-Domain Relation Extraction

Relation Extraction (RE) remains a challenging task, especially when con...
research
05/26/2023

GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks

Relation extraction (RE) tasks show promising performance in extracting ...
research
06/16/2023

Class-Adaptive Self-Training for Relation Extraction with Incompletely Annotated Training Data

Relation extraction (RE) aims to extract relations from sentences and do...
research
05/11/2017

Learning with Noise: Enhance Distantly Supervised Relation Extraction with Dynamic Transition Matrix

Distant supervision significantly reduces human efforts in building trai...
research
01/26/2021

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

Paraphrase generation plays an essential role in natural language proces...
research
09/19/2021

Training Dynamic based data filtering may not work for NLP datasets

The recent increase in dataset size has brought about significant advanc...

Please sign up or login with your details

Forgot password? Click here to reset