On the Strength of Character Language Models for Multilingual Named Entity Recognition

09/13/2018
by   Xiaodong Yu, et al.
0

Character-level patterns have been widely used as features in English Named Entity Recognition (NER) systems. However, to date there has been no direct investigation of the inherent differences between name and non-name tokens in text, nor whether this property holds across multiple languages. This paper analyzes the capabilities of corpus-agnostic Character-level Language Models (CLMs) in the binary task of distinguishing name tokens from non-name tokens. We demonstrate that CLMs provide a simple and powerful model for capturing these differences, identifying named entity tokens in a diverse set of languages at close to the performance of full NER systems. Moreover, by adding very simple CLM-based features we can significantly improve the performance of an off-the-shelf NER system for multiple languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2019

Named Entity Recognition for Nepali Language

Named Entity Recognition have been studied for different languages like ...
research
09/16/2021

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition

Pre-trained language models lead Named Entity Recognition (NER) into a n...
research
03/23/2021

TMR: Evaluating NER Recall on Tough Mentions

We propose the Tough Mentions Recall (TMR) metrics to supplement traditi...
research
03/06/2020

Improving Neural Named Entity Recognition with Gazetteers

The goal of this work is to improve the performance of a neural named en...
research
11/06/2021

Focusing on Possible Named Entities in Active Named Entity Label Acquisition

Named entity recognition (NER) aims to identify mentions of named entiti...
research
04/05/2022

LAMNER: Code Comment Generation Using Character Language Model and Named Entity Recognition

Code comment generation is the task of generating a high-level natural l...
research
09/12/2015

Kannada named entity recognition and classification (nerc) based on multinomial naïve bayes (mnb) classifier

Named Entity Recognition and Classification (NERC) is a process of ident...

Please sign up or login with your details

Forgot password? Click here to reset