Finding Influential Instances for Distantly Supervised Relation Extraction

by   Zifeng Wang, et al.

Distant supervision has been demonstrated to be highly beneficial to enhance relation extraction models, but it often suffers from high label noise. In this work, we propose a novel model-agnostic instance subsampling method for distantly supervised relation extraction, namely REIF, which bridges the gap of realizing influence subsampling in deep learning. It encompasses two key steps: first calculating instance-level influences that measure how much each training instance contributes to the validation loss change of our model, then deriving sampling probabilities via the proposed sigmoid sampling function to perform batch-in-bag sampling. We design a fast influence subsampling scheme that reduces the computational complexity from O(mn) to O(1), and analyze its robustness when the sigmoid sampling function is employed. Empirical experiments demonstrate our method's superiority over the baselines, and its ability to support interpretable instance selection.



page 1

page 2

page 3

page 4


Towards Time-Aware Distant Supervision for Relation Extraction

Distant supervision for relation extraction heavily suffers from the wro...

Structured Minimally Supervised Learning for Neural Relation Extraction

We present an approach to minimally supervised relation extraction that ...

A Sample-Based Training Method for Distantly Supervised Relation Extraction with Pre-Trained Transformers

Multiple instance learning (MIL) has become the standard learning paradi...

Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training

With recent advances in distantly supervised (DS) relation extraction (R...

A Simple, Strong and Robust Baseline for Distantly Supervised Relation Extraction

Distantly supervised relation extraction (DS-RE) is generally framed as ...

CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction

The journey of reducing noise from distant supervision (DS) generated tr...

Posterior-regularized REINFORCE for Instance Selection in Distant Supervision

This paper provides a new way to improve the efficiency of the REINFORCE...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.