Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

12/03/2022
by   Arshiya Aggarwal, et al.
0

We present a robust methodology for evaluating biases in natural language generation(NLG) systems. Previous works use fixed hand-crafted prefix templates with mentions of various demographic groups to prompt models to generate continuations for bias analysis. These fixed prefix templates could themselves be specific in terms of styles or linguistic structures, which may lead to unreliable fairness conclusions that are not representative of the general trends from tone varying prompts. To study this problem, we paraphrase the prompts with different syntactic structures and use these to evaluate demographic bias in NLG systems. Our results suggest similar overall bias trends but some syntactic structures lead to contradictory conclusions compared to past works. We show that our methodology is more robust and that some syntactic structures prompt more toxic content while others could prompt less biased generation. This suggests the importance of not relying on a fixed syntactic structure and using tone-invariant prompts. Introducing syntactically-diverse prompts can achieve more robust NLG (bias) evaluation.

READ FULL TEXT
research
10/09/2022

Quantifying Social Biases Using Templates is Unreliable

Recently, there has been an increase in efforts to understand how large ...
research
02/03/2021

BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems

Artificial Intelligence (AI) software systems, such as Sentiment Analysi...
research
09/16/2021

Balancing out Bias: Achieving Fairness Through Training Reweighting

Bias in natural language processing arises primarily from models learnin...
research
05/18/2022

"I'm sorry to hear that": finding bias in language models with a holistic descriptor dataset

As language models grow in popularity, their biases across all possible ...
research
03/06/2020

Demographic Bias in Presentation Attack Detection of Iris Recognition Systems

With the widespread use of biometric systems, the demographic bias probl...
research
08/13/2022

A Study of Demographic Bias in CNN-based Brain MR Segmentation

Convolutional neural networks (CNNs) are increasingly being used to auto...
research
02/11/2018

Syntax and Semantics of Italian Poetry in the First Half of the 20th Century

In this paper we study, analyse and comment rhetorical figures present i...

Please sign up or login with your details

Forgot password? Click here to reset