Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

04/20/2021
by   Shuguang Chen, et al.
0

Performance of neural models for named entity recognition degrades over time, becoming stale. This degradation is due to temporal drift, the change in our target variables' statistical properties over time. This issue is especially problematic for social media data, where topics change rapidly. In order to mitigate the problem, data annotation and retraining of models is common. Despite its usefulness, this process is expensive and time-consuming, which motivates new research on efficient model updating. In this paper, we propose an intuitive approach to measure the potential trendiness of tweets and use this metric to select the most informative instances to use for training. We conduct experiments on three state-of-the-art models on the Temporal Twitter Dataset. Our approach shows larger increases in prediction accuracy with less training data than the alternatives, making it an attractive, practical solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2022

Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis

Social media data such as Twitter messages ("tweets") pose a particular ...
research
10/07/2022

Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

Recent progress in language model pre-training has led to important impr...
research
07/24/2017

CAp 2017 challenge: Twitter Named Entity Recognition

The paper describes the CAp 2017 challenge. The challenge concerns the p...
research
10/15/2018

Neural Adaptation Layers for Cross-domain Named Entity Recognition

Recent research efforts have shown that neural architectures can be effe...
research
05/22/2023

Better Sampling of Negatives for Distantly Supervised Named Entity Recognition

Distantly supervised named entity recognition (DS-NER) has been proposed...
research
01/19/2021

Single versus Multiple Annotation for Named Entity Recognition of Mutations

The focus of this paper is to address the knowledge acquisition bottlene...
research
11/10/2015

USFD: Twitter NER with Drift Compensation and Linked Data

This paper describes a pilot NER system for Twitter, comprising the USFD...

Please sign up or login with your details

Forgot password? Click here to reset