AUC Maximization for Low-Resource Named Entity Recognition

12/09/2022
by   Ngoc Dang Nguyen, et al.
0

Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models and domains. The code to replicate this work will be provided upon request.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2022

AsNER – Annotated Dataset and Baseline for Assamese Named Entity recognition

We present the AsNER, a named entity annotation dataset for low resource...
research
04/15/2023

Robust Educational Dialogue Act Classifiers with Low-Resource and Imbalanced Datasets

Dialogue acts (DAs) can represent conversational actions of tutors or st...
research
09/04/2021

Data Augmentation for Cross-Domain Named Entity Recognition

Current work in named entity recognition (NER) shows that data augmentat...
research
03/06/2020

Improving Neural Named Entity Recognition with Gazetteers

The goal of this work is to improve the performance of a neural named en...
research
01/02/2021

A Robust and Domain-Adaptive Approach for Low-Resource Named Entity Recognition

Recently, it has attracted much attention to build reliable named entity...
research
11/03/2020

DAGA: Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

Data augmentation techniques have been widely used to improve machine le...
research
10/16/2021

Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER

Recent advances in prompt-based learning have shown impressive results o...

Please sign up or login with your details

Forgot password? Click here to reset