Perturbation Sensitivity Analysis to Detect Unintended Model Biases

10/09/2019
by   Vinodkumar Prabhakaran, et al.
0

Data-driven statistical Natural Language Processing (NLP) techniques leverage large amounts of language data to build models that can understand language. However, most language data reflect the public discourse at the time the data was produced, and hence NLP models are susceptible to learning incidental associations around named referents at a particular point in time, in addition to general linguistic meaning. An NLP system designed to model notions such as sentiment and toxicity should ideally produce scores that are independent of the identity of such entities mentioned in text and their social associations. For example, in a general purpose sentiment analysis system, a phrase such as I hate Katy Perry should be interpreted as having the same sentiment as I hate Taylor Swift. Based on this idea, we propose a generic evaluation framework, Perturbation Sensitivity Analysis, which detects unintended model biases related to named entities, and requires no new annotations or corpora. We demonstrate the utility of this analysis by employing it on two different NLP models — a sentiment model and a toxicity model — applied on online comments in English language from four different genres.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2020

Social Biases in NLP Models as Barriers for Persons with Disabilities

Building equitable and inclusive NLP technologies demands consideration ...
research
04/23/2019

Empirical Evaluation of Leveraging Named Entities for Arabic Sentiment Analysis

Social media reflects the public attitudes towards specific events. Even...
research
11/25/2021

Identification of Bias Against People with Disabilities in Sentiment Analysis and Toxicity Detection Models

Sociodemographic biases are a common problem for natural language proces...
research
02/07/2023

Applying BERT and ChatGPT for Sentiment Analysis of Lyme Disease in Scientific Literature

This chapter presents a practical guide for conducting Sentiment Analysi...
research
07/18/2023

Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models

We analyze sentiment analysis and toxicity detection models to detect th...
research
02/03/2021

BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems

Artificial Intelligence (AI) software systems, such as Sentiment Analysi...
research
01/20/2020

Text-based inference of moral sentiment change

We present a text-based framework for investigating moral sentiment chan...

Please sign up or login with your details

Forgot password? Click here to reset