Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes

11/20/2022
by   Jiuding Yang, et al.
0

Nested Named Entity Recognition (NNER) has been a long-term challenge to researchers as an important sub-area of Named Entity Recognition. NNER is where one entity may be part of a longer entity, and this may happen on multiple levels, as the term nested suggests. These nested structures make traditional sequence labeling methods unable to properly recognize all entities. While recent researches focus on designing better recognition methods for NNER in a variety of languages, the Chinese NNER (CNNER) still lacks attention, where a free-for-access, CNNER-specialized benchmark is absent. In this paper, we aim to solve CNNER problems by providing a Chinese dataset and a learning-based model to tackle the issue. To facilitate the research on this task, we release ChiNesE, a CNNER dataset with 20,000 sentences sampled from online passages of multiple domains, containing 117,284 entities failing in 10 categories, where 43.8 percent of those entities are nested. Based on ChiNesE, we propose Mulco, a novel method that can recognize named entities in nested structures through multiple scopes. Each scope use a designed scope-based sequence labeling method, which predicts an anchor and the length of a named entity to recognize it. Experiment results show that Mulco has outperformed several baseline methods with the different recognizing schemes on ChiNesE. We also conduct extensive experiments on ACE2005 Chinese corpus, where Mulco has achieved the best performance compared with the baseline methods.

READ FULL TEXT
research
03/22/2018

A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Evaluation Campaign

In this report, we describe our participant named-entity recognition sys...
research
11/09/2022

Nested Named Entity Recognition from Medical Texts: An Adaptive Shared Network Architecture with Attentive CRF

Recognizing useful named entities plays a vital role in medical informat...
research
04/21/2022

TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

Many areas, such as the biological and healthcare domain, artistic works...
research
09/05/2019

Nested Named Entity Recognition via Second-best Sequence Learning and Decoding

When an entity name contains other names within it, the identification o...
research
06/10/2019

Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks

Sequential labeling-based NER approaches restrict each word belonging to...
research
05/23/2022

RuNNE-2022 Shared Task: Recognizing Nested Named Entities

The RuNNE Shared Task approaches the problem of nested named entity reco...
research
10/14/2021

Fusing Heterogeneous Factors with Triaffine Mechanism for Nested Named Entity Recognition

Nested entities are observed in many domains due to their compositionali...

Please sign up or login with your details

Forgot password? Click here to reset