A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction

05/26/2020
by   Saadullah Amin, et al.
0

Fact triples are a common form of structured knowledge used within the biomedical domain. As the amount of unstructured scientific texts continues to grow, manual annotation of these texts for the task of relation extraction becomes increasingly expensive. Distant supervision offers a viable approach to combat this by quickly producing large amounts of labeled, but considerably noisy, data. We aim to reduce such noise by extending an entity-enriched relation classification BERT model to the problem of multiple instance learning, and defining a simple data encoding scheme that significantly reduces noise, reaching state-of-the-art performance for distantly-supervised biomedical relation extraction. Our approach further encodes knowledge about the direction of relation triples, allowing for increased focus on relation learning by reducing noise and alleviating the need for joint learning with knowledge graph completion.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2021

Abstractified Multi-instance Learning (AMIL) for Biomedical Relation Extraction

Relation extraction in the biomedical domain is a challenging task due t...
research
04/10/2022

MedDistant19: A Challenging Benchmark for Distantly Supervised Biomedical Relation Extraction

Relation Extraction in the biomedical domain is challenging due to the l...
research
04/21/2020

Relabel the Noise: Joint Extraction of Entities and Relations via Cooperative Multiagents

Distant supervision based methods for entity and relation extraction hav...
research
06/17/2019

BERE: An accurate distantly supervised biomedical entity relation extraction network

Automated entity relation extraction (RE) from literature provides an im...
research
06/10/2016

Bootstrapping Distantly Supervised IE using Joint Learning and Small Well-structured Corpora

We propose a framework to improve performance of distantly-supervised re...
research
07/12/2018

Making Efficient Use of a Domain Expert's Time in Relation Extraction

Scarcity of labeled data is one of the most frequent problems faced in m...
research
07/17/2022

An Overview of Distant Supervision for Relation Extraction with a Focus on Denoising and Pre-training Methods

Relation Extraction (RE) is a foundational task of natural language proc...

Please sign up or login with your details

Forgot password? Click here to reset