Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection

05/09/2022
by   Indira Sen, et al.
0

Counterfactually Augmented Data (CAD) aims to improve out-of-domain generalizability, an indicator of model robustness. The improvement is credited with promoting core features of the construct over spurious artifacts that happen to correlate with it. Yet, over-relying on core features may lead to unintended model bias. Especially, construct-driven CAD – perturbations of core features – may induce models to ignore the context in which core features are used. Here, we test models for sexism and hate speech detection on challenging data: non-hateful and non-sexist usage of identity and gendered terms. In these hard cases, models trained on CAD, especially construct-driven CAD, show higher false-positive rates than models trained on the original, unperturbed data. Using a diverse set of CAD – construct-driven and construct-agnostic – reduces such unintended bias.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2021

How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

As NLP models are increasingly deployed in socially situated settings su...
research
02/18/2023

Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough

Counterfactually-Augmented Data (CAD) has the potential to improve langu...
research
07/01/2021

An Investigation of the (In)effectiveness of Counterfactually Augmented Data

While pretrained language models achieve excellent performance on natura...
research
07/31/2023

Iterated Resultants in CAD

Cylindrical Algebraic Decomposition (CAD) by projection and lifting requ...
research
11/29/2022

AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

Recent studies have shown the impressive efficacy of counterfactually au...
research
02/11/2023

Lazard-style CAD and Equational Constraints

McCallum-style Cylindrical Algebra Decomposition (CAD) is a major improv...
research
01/31/2019

Advances in the Treatment of Trimmed CAD Models due to Isogeometric Analysis

Trimming is a core technique in geometric modeling. Unfortunately, the r...

Please sign up or login with your details

Forgot password? Click here to reset