BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

08/16/2019
by   Weipeng Huang, et al.
0

In this paper, we report our method for the Information Extraction task in 2019 Language and Intelligence Challenge. We incorporate BERT into the multi-head selection framework for joint entity-relation extraction. This model extends existing approaches from three perspectives. First, BERT is adopted as a feature extraction layer at the bottom of the multi-head selection framework. We further optimize BERT by introducing a semantic-enhanced task during BERT pre-training. Second, we introduce a large-scale Baidu Baike corpus for entity recognition pre-training, which is of weekly supervised learning since there is no actual named entity label. Third, soft label embedding is proposed to effectively transmit information between entity recognition and relation extraction. Combining these three contributions, we enhance the information extracting ability of the multi-head selection model and achieve F1-score 0.876 on testset-1 with a single model. By ensembling four variants of our model, we finally achieve F1 score 0.892 (1st place) on testset-1 and F1 score 0.8924 (2nd place) on testset-2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2019

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Entity and relation extraction is the necessary step in structuring medi...
research
09/17/2019

Span-based Joint Entity and Relation Extraction with Transformer Pre-training

We introduce SpERT, an attention model for span-based joint entity and r...
research
04/20/2018

Joint entity recognition and relation extraction as a multi-head selection problem

State-of-the-art models for joint entity recognition and relation extrac...
research
03/22/2023

W2KPE: Keyphrase Extraction with Word-Word Relation

This paper describes our submission to ICASSP 2023 MUG Challenge Track 4...
research
10/17/2022

KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recogniti...
research
11/30/2021

Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models

In Track-1 of the BioCreative VII Challenge participants are asked to id...
research
11/26/2021

Predicting Document Coverage for Relation Extraction

This paper presents a new task of predicting the coverage of a text docu...

Please sign up or login with your details

Forgot password? Click here to reset