Reducing Target Group Bias in Hate Speech Detectors

12/07/2021
by   Darsh J Shah, et al.
0

The ubiquity of offensive and hateful content on online fora necessitates the need for automatic solutions that detect such content competently across target groups. In this paper we show that text classification models trained on large publicly available datasets despite having a high overall performance, may significantly under-perform on several protected groups. On the <cit.> dataset, we find the accuracy to be 37% lower on an under annotated Black Women target group and 12% lower on Immigrants, where hate speech involves a distinct style. To address this, we propose to perform token-level hate sense disambiguation, and utilize tokens' hate sense representations for detection, modeling more general signals. On two publicly available datasets, we observe that the variance in model accuracy across target groups drops by at least 30%, improving the average target group performance by 4% and worst case performance by 13%.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2022

How does overparametrization affect performance on minority groups?

The benefits of overparameterization for the overall performance of mode...
research
12/18/2020

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Hate speech is a challenging issue plaguing the online social media. Whi...
research
04/21/2023

A Group-Specific Approach to NLP for Hate Speech Detection

Automatic hate speech detection is an important yet complex task, requir...
research
02/07/2021

"Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups

WhatsApp is the most popular messaging app in the world. Due to its popu...
research
11/30/2018

Detecting Offensive Content in Open-domain Conversations using Two Stage Semi-supervision

As open-ended human-chatbot interaction becomes commonplace, sensitive c...
research
05/11/2022

Linear average-case complexity of algorithmic problems in groups

The worst-case complexity of group-theoretic algorithms has been studied...

Please sign up or login with your details

Forgot password? Click here to reset