A Unified Deep Learning Architecture for Abuse Detection

02/01/2018
by   Antigoni-Maria Founta, et al.
0

Hate speech, offensive language, sexism, racism and other types of abusive behavior have become a common phenomenon in many online social media platforms. In recent years, such diverse abusive behaviors have been manifesting with increased frequency and levels of intensity. This is due to the openness and willingness of popular media platforms, such as Twitter and Facebook, to host content of sensitive or controversial topics. However, these platforms have not adequately addressed the problem of online abusive behavior, and their responsiveness to the effective detection and blocking of such inappropriate behavior remains limited. In the present paper, we study this complex problem by following a more holistic approach, which considers the various aspects of abusive behavior. To make the approach tangible, we focus on Twitter data and analyze user and textual properties from different angles of abusive posting behavior. We propose a deep learning architecture, which utilizes a wide variety of available metadata, and combines it with automatically-extracted hidden patterns within the text of the tweets, to detect multiple abusive behavioral norms which are highly inter-related. We apply this unified architecture in a seamless, transparent fashion to detect different types of abusive behavior (hate speech, sexism vs. racism, bullying, sarcasm, etc.) without the need for any tuning of the model architecture for each task. We test the proposed approach with multiple datasets addressing different and multiple abusive behaviors on Twitter. Our results demonstrate that it largely outperforms the state-of-art methods (between 21 and 45% improvement in AUC, depending on the dataset).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2020

Abusive Language Detection and Characterization of Twitter Behavior

In this work, abusive language detection in online content is performed ...
research
06/05/2020

"To Target or Not to Target": Identification and Analysis of Abusive Text Using Ensemble of Classifiers

With rising concern around abusive and hateful behavior on social media ...
research
10/08/2020

Detect All Abuse! Toward Universal Abusive Language Detection Models

Online abusive language detection (ALD) has become a societal issue of i...
research
06/30/2020

I call BS: Fraud Detection in Crowdfunding Campaigns

Donations to charity-based crowdfunding environments have been on the ri...
research
12/04/2021

Unraveling Social Perceptions Behaviors towards Migrants on Twitter

We draw insights from the social psychology literature to identify two f...
research
06/11/2020

Detection of Novel Social Bots by Ensembles of Specialized Classifiers

Malicious actors create inauthentic social media accounts controlled in ...
research
09/23/2018

Detecting Hate Speech and Offensive Language on Twitter using Machine Learning: An N-gram and TFIDF based Approach

Toxic online content has become a major issue in today's world due to an...

Please sign up or login with your details

Forgot password? Click here to reset