Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions

06/30/2022
by   Urja Khurana, et al.
0

Offensive Content Warning: This paper contains offensive language only for providing examples that clarify this research and do not reflect the authors' opinions. Please be aware that these examples are offensive and may cause you distress. The subjectivity of recognizing hate speech makes it a complex task. This is also reflected by different and incomplete definitions in NLP. We present hate speech criteria, developed with perspectives from law and social science, with the aim of helping researchers create more precise definitions and annotation guidelines on five aspects: (1) target groups, (2) dominance, (3) perpetrator characteristics, (4) type of negative group reference, and the (5) type of potential consequences/effects. Definitions can be structured so that they cover a more broad or more narrow phenomenon. As such, conscious choices can be made on specifying criteria or leaving them open. We argue that the goal and exact task developers have in mind should determine how the scope of hate speech is defined. We provide an overview of the properties of English datasets from <hatespeechdata.com> that may help select the most suitable dataset for a specific scenario.

READ FULL TEXT
research
06/30/2021

Whose Opinions Matter? Perspective-aware Models to Identify Opinions of Hate Speech Victims in Abusive Language Detection

Social media platforms provide users the freedom of expression and a med...
research
05/24/2021

Towards Standard Criteria for human evaluation of Chatbots: A Survey

Human evaluation is becoming a necessity to test the performance of Chat...
research
08/31/2023

CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset

English datasets predominantly reflect the perspectives of certain natio...
research
08/11/2021

Overview of the TREC 2020 Fair Ranking Track

This paper provides an overview of the NIST TREC 2020 Fair Ranking track...
research
11/03/2020

Treebanking User-Generated Content: a UD Based Overview of Guidelines, Corpora and Unified Recommendations

This article presents a discussion on the main linguistic phenomena whic...
research
01/27/2017

Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis

Some users of social media are spreading racist, sexist, and otherwise h...

Please sign up or login with your details

Forgot password? Click here to reset