Mitigating Political Bias in Language Models Through Reinforced Calibration

04/30/2021
by   Ruibo Liu, et al.
6

Current large-scale language models can be politically biased as a result of the data they are trained on, potentially causing serious problems when they are deployed in real-world settings. In this paper, we describe metrics for measuring political bias in GPT-2 generation and propose a reinforcement learning (RL) framework for mitigating political biases in generated text. By using rewards from word embeddings or a classifier, our RL framework guides debiased generation without having access to the training data or requiring the model to be retrained. In empirical experiments on three attributes sensitive to political bias (gender, location, and topic), our methods reduced bias according to both our metrics and human evaluation, while maintaining readability and semantic coherence.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2021

Towards Understanding and Mitigating Social Biases in Language Models

As machine learning methods are deployed in real-world settings such as ...
research
03/22/2022

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

Vision-language models can encode societal biases and stereotypes, but t...
research
11/29/2020

Inflating Topic Relevance with Ideology: A Case Study of Political Ideology Bias in Social Topic Detection Models

We investigate the impact of political ideology biases in training data....
research
06/07/2021

RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

Text representation models are prone to exhibit a range of societal bias...
research
06/11/2021

Assessing Political Prudence of Open-domain Chatbots

Politically sensitive topics are still a challenge for open-domain chatb...
research
06/22/2023

Apolitical Intelligence? Auditing Delphi's responses on controversial political issues in the US

As generative language models are deployed in ever-wider contexts, conce...
research
04/14/2023

The Self-Perception and Political Biases of ChatGPT

This contribution analyzes the self-perception and political biases of O...

Please sign up or login with your details

Forgot password? Click here to reset