Investigating Gender Bias in News Summarization

09/14/2023
by   Julius Steen, et al.
0

Summarization is an important application of large language models (LLMs). Most previous evaluation of summarization models has focused on their performance in content selection, grammaticality and coherence. However, it is well known that LLMs reproduce and reinforce harmful social biases. This raises the question: Do these biases affect model outputs in a relatively constrained setting like summarization? To help answer this question, we first motivate and introduce a number of definitions for biased behaviours in summarization models, along with practical measures to quantify them. Since we find biases inherent to the input document can confound our analysis, we additionally propose a method to generate input documents with carefully controlled demographic attributes. This allows us to sidestep this issue, while still working with somewhat realistic input documents. Finally, we apply our measures to summaries generated by both purpose-built summarization models and general purpose chat models. We find that content selection in single document summarization seems to be largely unaffected by bias, while hallucinations exhibit evidence of biases propagating to generated summaries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

Question-Based Salient Span Selection for More Controllable Text Summarization

In this work, we propose a method for incorporating question-answering (...
research
09/22/2021

MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization

One of the most challenging aspects of current single-document news summ...
research
05/03/2023

Characterizing Political Bias in Automatic Summaries: A Case Study of Trump and Biden

Growing literature has shown that powerful NLP systems may encode social...
research
10/04/2022

Towards Improving Faithfulness in Abstractive Summarization

Despite the success achieved in neural abstractive summarization based o...
research
05/31/2022

NEWTS: A Corpus for News Topic-Focused Summarization

Text summarization models are approaching human levels of fidelity. Exis...
research
04/27/2020

Screenplay Summarization Using Latent Narrative Structure

Most general-purpose extractive summarization models are trained on news...
research
06/09/2016

PerSum: Novel Systems for Document Summarization in Persian

In this paper we explore the problem of document summarization in Persia...

Please sign up or login with your details

Forgot password? Click here to reset