A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media

10/28/2019
by   Marzieh Mozafari, et al.
0

Generated hateful and toxic content by a portion of users in social media is a rising phenomenon that motivated researchers to dedicate substantial efforts to the challenging direction of hateful content identification. We not only need an efficient automatic hate speech detection model based on advanced machine learning and natural language processing, but also a sufficiently large amount of annotated data to train a model. The lack of a sufficient amount of labelled hate speech data, along with the existing biases, has been the main issue in this domain of research. To address these needs, in this study we introduce a novel transfer learning approach based on an existing pre-trained language model called BERT (Bidirectional Encoder Representations from Transformers). More specifically, we investigate the ability of BERT at capturing hateful context within social media content by using new fine-tuning methods based on transfer learning. To evaluate our proposed approach, we use two publicly available datasets that have been annotated for racism, sexism, hate, or offensive content on Twitter. The results show that our solution obtains considerable performance on these datasets in terms of precision and recall in comparison to existing approaches. Consequently, our model can capture some biases in data annotation and collection process and can potentially lead us to a more accurate model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2021

HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection

Hateful and Toxic content has become a significant concern in today's wo...
research
08/19/2021

How Hateful are Movies? A Study and Prediction on Movie Subtitles

In this research, we investigate techniques to detect hate speech in mov...
research
02/09/2021

Transfer Learning Approach for Arabic Offensive Language Detection System – BERT-Based Model

Developing a system to detect online offensive language is very importan...
research
08/14/2020

Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model

Disparate biases associated with datasets and trained classifiers in hat...
research
06/03/2021

Defending Democracy: Using Deep Learning to Identify and Prevent Misinformation

The rise in online misinformation in recent years threatens democracies ...
research
12/07/2020

Detecting Insincere Questions from Text: A Transfer Learning Approach

The internet today has become an unrivalled source of information where ...
research
01/27/2023

Down the Rabbit Hole: Detecting Online Extremism, Radicalisation, and Politicised Hate Speech

Social media is a modern person's digital voice to project and engage wi...

Please sign up or login with your details

Forgot password? Click here to reset