An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

08/09/2022
by   Hang Yan, et al.
3

Named entity recognition (NER) is the task to detect and classify the entity spans in the text. When entity spans overlap between each other, this problem is named as nested NER. Span-based methods have been widely used to tackle the nested NER. Most of these methods will get a score n × n matrix, where n means the length of sentence, and each entry corresponds to a span. However, previous work ignores spatial relations in the score matrix. In this paper, we propose using Convolutional Neural Network (CNN) to model these spatial relations in the score matrix. Despite being simple, experiments in three commonly used nested NER datasets show that our model surpasses several recently proposed methods with the same pre-trained encoders. Further analysis shows that using CNN can help the model find nested entities more accurately. Besides, we found that different papers used different sentence tokenizations for the three nested NER datasets, which will influence the comparison. Thus, we release a pre-processing script to facilitate future comparison.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2022

A dynamic programming algorithm for span-based nested named-entity recognition in O(n^2)

Span-based nested named-entity recognition (NER) has a cubic-time comple...
research
10/03/2018

A Neural Transition-based Model for Nested Mention Recognition

It is common that entity mentions can contain other mentions recursively...
research
04/12/2022

Trigger-GNN: A Trigger-Based Graph Neural Network for Nested Named Entity Recognition

Nested named entity recognition (NER) aims to identify the entity bounda...
research
04/17/2022

Nested Named Entity Recognition as Holistic Structure Parsing

As a fundamental natural language processing task and one of core knowle...
research
05/18/2022

A reproducible experimental survey on biomedical sentence similarity: a string-based method sets the state of the art

This registered report introduces the largest, and for the first time, r...
research
06/09/2020

Extensive Error Analysis and a Learning-Based Evaluation of Medical Entity Recognition Systems to Approximate User Experience

When comparing entities extracted by a medical entity recognition system...
research
06/11/2021

EPICURE Ensemble Pretrained Models for Extracting Cancer Mutations from Literature

To interpret the genetic profile present in a patient sample, it is nece...

Please sign up or login with your details

Forgot password? Click here to reset