Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter

02/27/2018
by   Ziqi Zhang, et al.
0

In recent years, the increasing propagation of hate speech on social media and the urgent need for effective counter-measures have drawn significant investment from governments, companies, and empirical research. Despite a large number of emerging, scientific studies to address the problem, the performance of existing automated methods at identifying specific types of hate speech - as opposed to identifying non-hate -is still very unsatisfactory, and the reasons behind are poorly understood. This work undertakes the first in-depth analysis towards this problem and shows that, the very challenging nature of identifying hate speech on the social media is largely due to the extremely unbalanced presence of real hateful content in the typical datasets, and the lack of unique, discriminative features in such content, both causing them to reside in the 'long tail' of a dataset that is difficult to discover. To address this issue, we propose novel Deep Neural Network structures serving as effective feature extractors, and explore the usage of background information in the form of different word embeddings pre-trained from unlabelled corpora. We empirically evaluate our methods on the largest collection of hate speech datasets based on Twitter, and show that our methods can significantly outperform state of the art, as they are able to obtain a maximum improvement of between 4 and 16 percentage points (macro-average F1) depending on datasets.

READ FULL TEXT
research
10/25/2020

CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media

In recent years, social media platforms have hosted an explosion of hate...
research
01/08/2021

Leveraging Multilingual Transformers for Hate Speech Detection

Detecting and classifying instances of hate in social media text has bee...
research
09/27/2018

Predictive Embeddings for Hate Speech Detection on Twitter

We present a neural-network based approach to classifying online hate sp...
research
12/12/2018

Detecting weak and strong Islamophobic hate speech on social media

Islamophobic hate speech on social media inflicts considerable harm on b...
research
02/14/2019

Author Profiling for Hate Speech Detection

The rapid growth of social media in recent years has fed into some highl...
research
06/01/2021

Improving Automatic Hate Speech Detection with Multiword Expression Features

The task of automatically detecting hate speech in social media is gaini...
research
03/16/2021

dictNN: A Dictionary-Enhanced CNN Approach for Classifying Hate Speech on Twitter

Hate speech on social media is a growing concern, and automated methods ...

Please sign up or login with your details

Forgot password? Click here to reset