Robustness to Capitalization Errors in Named Entity Recognition

11/13/2019
by   Sravan Bodapati, et al.
7

Robustness to capitalization errors is a highly desirable characteristic of named entity recognizers, yet we find standard models for the task are surprisingly brittle to such noise. Existing methods to improve robustness to the noise completely discard given orthographic information, mwhich significantly degrades their performance on well-formed text. We propose a simple alternative approach based on data augmentation, which allows the model to learn to utilize or ignore orthographic information depending on its usefulness in the context. It achieves competitive robustness to capitalization errors while making negligible compromise to its performance on well-formed text and significantly improving generalization power on noisy user-generated text. Our experiments clearly and consistently validate our claim across different types of machine learning models, languages, and dataset sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks

Data augmentation techniques have been used to improve the generalizatio...
research
08/15/2023

Informed Named Entity Recognition Decoding for Generative Language Models

Ever-larger language models with ever-increasing capabilities are by now...
research
06/14/2023

Research on Named Entity Recognition in Improved transformer with R-Drop structure

To enhance the generalization ability of the model and improve the effec...
research
05/14/2020

NAT: Noise-Aware Training for Robust Neural Sequence Labeling

Sequence labeling systems should perform reliably not only under ideal c...
research
06/28/2017

Named Entity Disambiguation for Noisy Text

We address the task of Named Entity Disambiguation (NED) for noisy text....
research
10/14/2021

Understanding Model Robustness to User-generated Noisy Texts

Sensitivity of deep-neural models to input noise is known to be a challe...

Please sign up or login with your details

Forgot password? Click here to reset