Combining Spans into Entities: A Neural Two-Stage Approach for Recognizing Discontiguous Entities

09/03/2019
by   Bailin Wang, et al.
0

In medical documents, it is possible that an entity of interest not only contains a discontiguous sequence of words but also overlaps with another entity. Entities of such structures are intrinsically hard to recognize due to the large space of possible entity combinations. In this work, we propose a neural two-stage approach to recognize discontiguous and overlapping entities by decomposing this problem into two subtasks: 1) it first detects all the overlapping spans that either form entities on their own or present as segments of discontiguous entities, based on the representation of segmental hypergraph, 2) next it learns to combine these segments into discontiguous entities with a classifier, which filters out other incorrect combinations of segments. Two neural components are designed for these subtasks respectively and they are learned jointly using a shared encoder for text. Our model achieves the state-of-the-art performance in a standard dataset, even in the absence of external features that previous methods used.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2019

Multi-Grained Named Entity Recognition

This paper presents a novel framework, MGNER, for Multi-Grained Named En...
research
10/03/2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition

In this work, we propose a novel segmental hypergraph representation to ...
research
10/19/2018

Learning to Recognize Discontiguous Entities

This paper focuses on the study of recognizing discontiguous entities. M...
research
02/14/2020

Understanding patient complaint characteristics using contextual clinical BERT embeddings

In clinical conversational applications, extracted entities tend to capt...
research
09/13/2021

MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

Entity retrieval, which aims at disambiguating mentions to canonical ent...
research
10/22/2018

Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators

In this paper, we propose a new model that is capable of recognizing ove...
research
09/22/2014

Temporally Coherent Bayesian Models for Entity Discovery in Videos by Tracklet Clustering

A video can be represented as a sequence of tracklets, each spanning 10-...

Please sign up or login with your details

Forgot password? Click here to reset