Characterizing and Measuring Linguistic Dataset Drift

05/26/2023
by   Tyler A. Chang, et al.
0

NLP models often degrade in performance when real world data distributions differ markedly from training data. However, existing dataset drift metrics in NLP have generally not considered specific dimensions of linguistic drift that affect model performance, and they have not been validated in their ability to predict model performance at the individual example level, where such metrics are often used in practice. In this paper, we propose three dimensions of linguistic dataset drift: vocabulary, structural, and semantic drift. These dimensions correspond to content word frequency divergences, syntactic divergences, and meaning changes not captured by word frequencies (e.g. lexical semantic change). We propose interpretable metrics for all three drift dimensions, and we modify past performance prediction methods to predict model performance at both the example and dataset level for English sentiment classification and natural language inference. We find that our drift metrics are more effective than previous metrics at predicting out-of-domain model accuracies (mean 16.8 compared to popular fine-tuned embedding distances (mean 47.7 Fine-tuned embedding distances are much more effective at ranking individual examples by expected performance, but decomposing into vocabulary, structural, and semantic drift produces the best example rankings of all considered model-agnostic drift metrics (mean 6.7

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2021

How to Do Things without Words: Modeling Semantic Drift of Emoji

Emoji have become a significant part of our informal textual communicati...
research
06/06/2019

Visualizing and Measuring the Geometry of BERT

Transformer architectures show significant promise for natural language ...
research
07/24/2023

Control and Monitoring of Artificial Intelligence Algorithms

This paper elucidates the importance of governing an artificial intellig...
research
08/02/2016

Evolutionary forces in language change

Languages and genes are both transmitted from generation to generation, ...
research
05/11/2023

A maturity model for catalogues of semantic artefacts

The work presented in this paper is twofold. On the one hand, we aim to ...
research
08/04/2023

Tweet Insights: A Visualization Platform to Extract Temporal Insights from Twitter

This paper introduces a large collection of time series data derived fro...
research
03/28/2020

Countering Language Drift with Seeded Iterated Learning

Supervised learning methods excel at capturing statistical properties of...

Please sign up or login with your details

Forgot password? Click here to reset