Generating Counter Narratives against Online Hate Speech: Data and Strategies

04/08/2020
by   Serra Sinem Tekiroglu, et al.
0

Recently research has started focusing on avoiding undesired effects that come with content moderation, such as censorship and overblocking, when dealing with hatred online. The core idea is to directly intervene in the discussion with textual responses that are meant to counter the hate content and prevent it from further spreading. Accordingly, automation strategies, such as natural language generation, are beginning to be investigated. Still, they suffer from the lack of sufficient amount of quality data and tend to produce generic/repetitive responses. Being aware of the aforementioned limitations, we present a study on how to collect responses to hate effectively, employing large scale unsupervised language models such as GPT-2 for the generation of silver data, and the best annotation strategies/neural architectures that can be used for data filtering before expert validation/post-editing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2021

Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech

Tackling online hatred using informed textual responses - called counter...
research
10/08/2019

CONAN – COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

Although there is an unprecedented effort to provide adequate responses ...
research
04/04/2022

Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study

In this work, we present an extensive study on the use of pre-trained la...
research
03/11/2023

Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation

The spread of online misinformation threatens public health, democracy, ...
research
07/01/2023

Understanding Counterspeech for Online Harm Mitigation

Counterspeech offers direct rebuttals to hateful speech by challenging p...
research
04/29/2022

Handling and Presenting Harmful Text

Textual data can pose a risk of serious harm. These harms can be categor...
research
07/30/2022

ELF22: A Context-based Counter Trolling Dataset to Combat Internet Trolls

Online trolls increase social costs and cause psychological damage to in...

Please sign up or login with your details

Forgot password? Click here to reset