Mitigating Bias in Conversations: A Hate Speech Classifier and Debiaser with Prompts

07/14/2023
by   Shaina Raza, et al.
0

Discriminatory language and biases are often present in hate speech during conversations, which usually lead to negative impacts on targeted groups such as those based on race, gender, and religion. To tackle this issue, we propose an approach that involves a two-step process: first, detecting hate speech using a classifier, and then utilizing a debiasing component that generates less biased or unbiased alternatives through prompts. We evaluated our approach on a benchmark dataset and observed reduction in negativity due to hate speech comments. The proposed method contributes to the ongoing efforts to reduce biases in online discourse and promote a more inclusive and fair environment for communication.

READ FULL TEXT
research
05/13/2022

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes

To tackle the rising phenomenon of hate speech, efforts have been made t...
research
09/07/2021

Hi, my name is Martha: Using names to measure and mitigate bias in generative dialogue models

All AI models are susceptible to learning biases in data that they are t...
research
12/22/2021

Quantifying Gender Biases Towards Politicians on Reddit

Despite attempts to increase gender parity in politics, global efforts h...
research
09/16/2020

Impact and dynamics of hate and counter speech online

Citizen-generated counter speech is a promising way to fight hate speech...
research
10/28/2021

Hate Speech Classifiers Learn Human-Like Social Stereotypes

Social stereotypes negatively impact individuals' judgements about diffe...
research
09/11/2023

Detecting Natural Language Biases with Prompt-based Learning

In this project, we want to explore the newly emerging field of prompt e...
research
01/23/2018

The Enemy Among Us: Detecting Hate Speech with Threats Based 'Othering' Language Embeddings

Offensive or antagonistic language targeted at individuals and social gr...

Please sign up or login with your details

Forgot password? Click here to reset