Identifying Spurious Correlations for Robust Text Classification

10/06/2020
by   Zhao Wang, et al.
0

The predictions of text classifiers are often driven by spurious correlations – e.g., the term `Spielberg' correlates with positively reviewed movies, even though the term itself does not semantically convey a positive sentiment. In this paper, we propose a method to distinguish spurious and genuine correlations in text classification. We treat this as a supervised classification problem, using features derived from treatment effect estimators to distinguish spurious correlations from "genuine" ones. Due to the generic nature of these features and their small dimensionality, we find that the approach works well even with limited training examples, and that it is possible to transport the word classifier to new domains. Experiments on four datasets (sentiment classification and toxicity detection) suggest that using this approach to inform feature selection also leads to more robust classification, as measured by improved worst-case accuracy on the samples affected by spurious correlations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2021

Does a Hybrid Neural Network based Feature Selection Model Improve Text Classification?

Text classification is a fundamental problem in the field of natural lan...
research
06/17/2018

An Improved Text Sentiment Classification Model Using TF-IDF and Next Word Negation

With the rapid growth of Text sentiment analysis, the demand for automat...
research
02/13/2023

Identifying Semantically Difficult Samples to Improve Text Classification

In this paper, we investigate the effect of addressing difficult samples...
research
05/26/2021

Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers

We propose Predict then Interpolate (PI), a simple algorithm for learnin...
research
10/17/2017

Multi-Task Label Embedding for Text Classification

Multi-task learning in text classification leverages implicit correlatio...
research
06/22/2023

Identifying and Disentangling Spurious Features in Pretrained Image Representations

Neural networks employ spurious correlations in their predictions, resul...
research
06/10/2019

A cost-reducing partial labeling estimator in text classification problem

We propose a new approach to address the text classification problems wh...

Please sign up or login with your details

Forgot password? Click here to reset