Sources of Transfer in Multilingual Named Entity Recognition

05/02/2020
by   David Mueller, et al.
0

Named-entities are inherently multilingual, and annotations in any given language may be limited. This motivates us to consider polyglot named-entity recognition (NER), where one model is trained using annotated data drawn from more than one language. However, a straightforward implementation of this simple idea does not always work in practice: naive training of NER models using annotated data drawn from multiple languages consistently underperforms models trained on monolingual data alone, despite having access to more training data. The starting point of this paper is a simple solution to this problem, in which polyglot models are fine-tuned on monolingual data to consistently and significantly outperform their monolingual counterparts. To explain this phenomena, we explore the sources of multilingual transfer in polyglot NER models and examine the weight structure of polyglot models compared to their monolingual counterparts. We find that polyglot models efficiently share many parameters across languages and that fine-tuning may utilize a large number of those parameters.

READ FULL TEXT

page 7

page 8

research
05/05/2023

LLM-RM at SemEval-2023 Task 2: Multilingual Complex NER using XLM-RoBERTa

Named Entity Recognition(NER) is a task of recognizing entities at a tok...
research
03/24/2022

Mono vs Multilingual BERT: A Case Study in Hindi and Marathi Named Entity Recognition

Named entity recognition (NER) is the process of recognising and classif...
research
04/08/2021

COVID-19 Named Entity Recognition for Vietnamese

The current COVID-19 pandemic has lead to the creation of many corpora t...
research
05/30/2023

A Multilingual Evaluation of NER Robustness to Adversarial Inputs

Adversarial evaluations of language models typically focus on English al...
research
06/10/2023

Enhancing Low Resource NER Using Assisting Language And Transfer Learning

Named Entity Recognition (NER) is a fundamental task in NLP that is used...
research
02/21/2019

Pretrained language model transfer on neural named entity recognition in Indonesian conversational texts

Named entity recognition (NER) is an important task in NLP, which is all...
research
05/01/2020

Partially-Typed NER Datasets Integration: Connecting Practice to Theory

While typical named entity recognition (NER) models require the training...

Please sign up or login with your details

Forgot password? Click here to reset