Log In Sign Up

CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media

by   Sayyed M. Zahiri, et al.

In recent years, social media platforms have hosted an explosion of hate speech and objectionable content. The urgent need for effective automatic hate speech detection models have drawn remarkable investment from companies and researchers. Social media posts are generally short and their semantics could drastically be altered by even a single token. Thus, it is crucial for this task to learn context-aware input representations, and consider relevancy scores between input embeddings and class representations as an additional signal. To accommodate these needs, this paper introduces CRAB (Class Representation Attentive BERT), a neural model for detecting hate speech in social media. The model benefits from two semantic representations: (i) trainable token-wise and sentence-wise class representations, and (ii) contextualized input embeddings from state-of-the-art BERT encoder. To investigate effectiveness of CRAB, we train our model on Twitter data and compare it against strong baselines. Our results show that CRAB achieves 1.89 relative improved Macro-averaged F1 over state-of-the-art baseline. The results of this research open an opportunity for the future research on automated abusive behavior detection in social media


page 1

page 2

page 3

page 4


Contextual Hate Speech Detection in Code Mixed Text using Transformer Based Approaches

In the recent past, social media platforms have helped people in connect...

Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter

In recent years, the increasing propagation of hate speech on social med...

Improving Automatic Hate Speech Detection with Multiword Expression Features

The task of automatically detecting hate speech in social media is gaini...

A study of text representations in Hate Speech Detection

The pervasiveness of the Internet and social media have enabled the rapi...

Automatic Detection of Sexist Statements Commonly Used at the Workplace

Detecting hate speech in the workplace is a unique classification task, ...

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments

Building on current work on multilingual hate speech (e.g., Ousidhoum et...

Learning Class-specific Word Representations for Early Detection of Hoaxes in Social Media

As people increasingly use social media as a source for news consumption...