Improved two-stage hate speech classification for twitter based on Deep Neural Networks

06/08/2022
by   Georgios K. Pitsilis, et al.
0

Hate speech is a form of online harassment that involves the use of abusive language, and it is commonly seen in social media posts. This sort of harassment mainly focuses on specific group characteristics such as religion, gender, ethnicity, etc and it has both societal and economic consequences nowadays. The automatic detection of abusive language in text postings has always been a difficult task, but it is lately receiving much interest from the scientific community. This paper addresses the important problem of discerning hateful content in social media. The model we propose in this work is an extension of an existing approach based on LSTM neural network architectures, which we appropriately enhanced and fine-tuned to detect certain forms of hatred language, such as racism or sexism, in a short text. The most significant enhancement is the conversion to a two-stage scheme consisting of Recurrent Neural Network (RNN) classifiers. The output of all One-vs-Rest (OvR) classifiers from the first stage are combined and used to train the second stage classifier, which finally determines the type of harassment. Our study includes a performance comparison of several proposed alternative methods for the second stage evaluated on a public corpus of 16k tweets, followed by a generalization study on another dataset. The reported results show the superior classification quality of the proposed scheme in the task of hate speech detection as compared to the current state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2018

Detecting Offensive Language in Tweets Using Deep Learning

This paper addresses the important problem of discerning hateful content...
research
09/26/2020

Abusive Language Detection and Characterization of Twitter Behavior

In this work, abusive language detection in online content is performed ...
research
03/06/2022

Enhanced Sentiment Extraction Architecture for Social Media Content Analysis Using Capsule Networks

Recent research has produced efficient algorithms based on deep learning...
research
04/13/2020

Gender Detection on Social Networks using Ensemble Deep Learning

Analyzing the ever-increasing volume of posts on social media sites such...
research
04/16/2019

UTFPR at SemEval-2019 Task 5: Hate Speech Identification with Recurrent Neural Networks

In this paper we revisit the problem of automatically identifying hate s...
research
10/15/2019

Language Identification on Massive Datasets of Short Message using an Attention Mechanism CNN

Language Identification (LID) is a challenging task, especially when the...
research
01/13/2021

Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings

Due to the wide adoption of social media platforms like Facebook, Twitte...

Please sign up or login with your details

Forgot password? Click here to reset