TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

04/21/2022
by   Nazia Tasnim, et al.
0

Many areas, such as the biological and healthcare domain, artistic works, and organization names, have nested, overlapping, discontinuous entity mentions that may even be syntactically or semantically ambiguous in practice. Traditional sequence tagging algorithms are unable to recognize these complex mentions because they may violate the assumptions upon which sequence tagging schemes are founded. In this paper, we describe our contribution to SemEval 2022 Task 11 on identifying such complex Named Entities. We have leveraged the ensemble of multiple ELECTRA-based models that were exclusively pretrained on the Bangla language with the performance of ELECTRA-based models pretrained on English to achieve competitive performance on the Track-11. Besides providing a system description, we will also present the outcomes of our experiments on architectural decisions, dataset augmentations, and post-competition findings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/20/2022

Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes

Nested Named Entity Recognition (NNER) has been a long-term challenge to...
research
06/02/2021

A Unified Generative Framework for Various NER Subtasks

Named Entity Recognition (NER) is the task of identifying spans that rep...
research
06/28/2021

A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition

Research on overlapped and discontinuous named entity recognition (NER) ...
research
10/21/2022

NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities

This paper describes NEREL-BIO – an annotation scheme and corpus of PubM...
research
06/23/2021

Recognising Biomedical Names: Challenges and Solutions

The growth rate in the amount of biomedical documents is staggering. Unl...
research
06/11/2021

EPICURE Ensemble Pretrained Models for Extracting Cancer Mutations from Literature

To interpret the genetic profile present in a patient sample, it is nece...

Please sign up or login with your details

Forgot password? Click here to reset