Leveraging cross-platform data to improve automated hate speech detection

02/09/2021
by   John D Gallacher, et al.
12

Hate speech is increasingly prevalent online, and its negative outcomes include increased prejudice, extremism, and even offline hate crime. Automatic detection of online hate speech can help us to better understand these impacts. However, while the field has recently progressed through advances in natural language processing, challenges still remain. In particular, most existing approaches for hate speech detection focus on a single social media platform in isolation. This limits both the use of these models and their validity, as the nature of language varies from platform to platform. Here we propose a new cross-platform approach to detect hate speech which leverages multiple datasets and classification models from different platforms and trains a superlearner that can combine existing and novel training data to improve detection and increase model applicability. We demonstrate how this approach outperforms existing models, and achieves good performance when tested on messages from novel social media platforms not included in the original training data.

READ FULL TEXT

page 7

page 12

page 14

page 31

research
04/01/2022

Cyberbullying detection across social media platforms via platform-aware adversarial encoding

Despite the increasing interest in cyberbullying detection, existing eff...
research
09/04/2023

Hateful Messages: A Conversational Data Set of Hate Speech produced by Adolescents on Discord

With the rise of social media, a rise of hateful content can be observed...
research
07/06/2021

Empowering NGOs in Countering Online Hate Messages

Studies on online hate speech have mostly focused on the automated detec...
research
11/11/2022

Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning

The prevalence of abusive language on different online platforms has bee...
research
07/04/2023

Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation

The automatic detection of hate speech online is an active research area...
research
02/17/2021

Towards generalisable hate speech detection: a review on obstacles and solutions

Hate speech is one type of harmful online content which directly attacks...
research
04/09/2018

Leveraging Intra-User and Inter-User Representation Learning for Automated Hate Speech Detection

Hate speech detection is a critical, yet challenging problem in Natural ...

Please sign up or login with your details

Forgot password? Click here to reset