Transfer Learning for Hate Speech Detection in Social Media

06/10/2019
by   Marian-Andrei Rizoiu, et al.
0

In today's society more and more people are connected to the Internet, and its information and communication technologies have become an essential part of our everyday life. Unfortunately, the flip side of this increased connectivity to social media and other online contents is cyber-bullying and -hatred, among other harmful and anti-social behaviors. Models based on machine learning and natural language processing provide a way to detect this hate speech in web text in order to make discussion forums and other media and platforms safer. The main difficulty, however, is annotating a sufficiently large number of examples to train these models. In this paper, we report on developing automated text analytics methods, capable of jointly learning a single representation of hate from several smaller, unrelated data sets. We train and test our methods on the total of 37,520 English tweets that have been annotated for differentiating harmless messages from racist or sexists contexts in the first detection task, and hateful or offensive contents in the second detection task. Our most sophisticated method combines a deep neural network architecture with transfer learning. It is capable of creating word and sentence embeddings that are specific to these tasks while also embedding the meaning of generic hate speech. Its prediction correctness is the macro-averaged F1 of 78% and 72% in the first and second task, respectively. This method enables generating an interpretable two-dimensional text visualization --- called the Map of Hate --- that is capable of separating different types of hate speech and explaining what makes text harmful. These methods and insights hold a potential for not only safer social media, but also reduced need to expose human moderators and annotators to distressing online messaging.

READ FULL TEXT

page 6

page 7

page 8

page 9

page 11

research
05/03/2022

Detection of Propaganda Techniques in Visuo-Lingual Metaphor in Memes

The exponential rise of social media networks has allowed the production...
research
08/19/2021

A Multi-input Multi-output Transformer-based Hybrid Neural Network for Multi-class Privacy Disclosure Detection

The concern regarding users' data privacy has risen to its highest level...
research
02/19/2021

KBCNMUJAL@HASOC-Dravidian-CodeMix-FIRE2020: Using Machine Learning for Detection of Hate Speech and Offensive Code-Mixed Social Media text

This paper describes the system submitted by our team, KBCNMUJAL, for Ta...
research
04/27/2020

"Unsex me here": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples

To effectively tackle sexism online, research has focused on automated m...
research
07/09/2020

Automatic Personality Prediction; an Enhanced Method Using Ensemble Modeling

Human personality is significantly represented by those words which he/s...
research
05/26/2022

MemeTector: Enforcing deep focus for meme detection

Image memes and specifically their widely-known variation image macros, ...
research
07/15/2019

Low-supervision urgency detection and transfer in short crisis messages

Humanitarian disasters have been on the rise in recent years due to the ...

Please sign up or login with your details

Forgot password? Click here to reset