Hate Speech and Counter Speech Detection: Conversational Context Does Matter

06/13/2022
by   Xinchen Yu, et al.
0

Hate speech is plaguing the cyberspace along with user-generated content. This paper investigates the role of conversational context in the annotation and detection of online hate and counter speech, where context is defined as the preceding comment in a conversation thread. We created a context-aware dataset for a 3-way classification task on Reddit comments: hate speech, counter speech, or neutral. Our analyses indicate that context is critical to identify hate and counter speech: human judgments change for most comments depending on whether we show annotators the context. A linguistic analysis draws insights into the language people use to express hate and counter speech. Experimental results show that neural networks obtain significantly better results if context is taken into account. We also present qualitative error analyses shedding light into (a) when and why context is beneficial and (b) the remaining errors made by our best model when context is taken into account.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2020

Impact and dynamics of hate and counter speech online

Citizen-generated counter speech is a promising way to fight hate speech...
research
04/08/2019

Disfluencies and Human Speech Transcription Errors

This paper explores contexts associated with errors in transcrip-tion of...
research
05/28/2017

Understanding Abuse: A Typology of Abusive Language Detection Subtasks

As the body of research on abusive language detection and analysis grows...
research
01/31/2019

Conversational Networks for Automatic Online Moderation

Moderation of user-generated content in an online community is a challen...
research
10/20/2017

Detecting Online Hate Speech Using Context Aware Models

In the wake of a polarizing election, the cyber world is laden with hate...
research
11/11/2022

CoRAL: a Context-aware Croatian Abusive Language Dataset

In light of unprecedented increases in the popularity of the internet an...
research
04/07/2022

Korean Online Hate Speech Dataset for Multilabel Classification: How Can Social Science Improve Dataset on Hate Speech?

We suggest a multilabel Korean online hate speech dataset that covers se...

Please sign up or login with your details

Forgot password? Click here to reset