Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models

06/20/2022
by   Paul Röttger, et al.
6

Hate speech detection models are typically evaluated on held-out test sets. However, this risks painting an incomplete and potentially misleading picture of model performance because of increasingly well-documented systematic gaps and biases in hate speech datasets. To enable more targeted diagnostic insights, recent research has thus introduced functional tests for hate speech detection models. However, these tests currently only exist for English-language content, which means that they cannot support the development of more effective models in other languages spoken by billions across the world. To help address this issue, we introduce Multilingual HateCheck (MHC), a suite of functional tests for multilingual hate speech detection models. MHC covers 34 functionalities across ten languages, which is more languages than any other hate speech dataset. To illustrate MHC's utility, we train and test a high-performing multilingual hate speech detection model, and reveal critical model weaknesses for monolingual and cross-lingual applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2020

HateCheck: Functional Tests for Hate Speech Detection Models

Detecting online hate is a difficult task that even state-of-the-art mod...
research
04/09/2018

Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech

In this paper, we explore the learning of neural network embeddings for ...
research
04/08/2022

Checking HateCheck: a cross-functional analysis of behaviour-aware learning for hate speech detection

Behavioural testing – verifying system capabilities by validating human-...
research
11/23/2020

An Online Multilingual Hate speech Recognition System

The exponential increase in the use of the Internet and social media ove...
research
01/27/2022

Highly Generalizable Models for Multilingual Hate Speech Detection

Hate speech detection has become an important research topic within the ...
research
05/22/2023

Evaluating ChatGPT's Performance for Multilingual and Emoji-based Hate Speech Detection

Hate speech is a severe issue that affects many online platforms. So far...
research
08/04/2023

Adapting the NICT-JLE Corpus for Disfluency Detection Models

The detection of disfluencies such as hesitations, repetitions and false...

Please sign up or login with your details

Forgot password? Click here to reset