Detect All Abuse! Toward Universal Abusive Language Detection Models

10/08/2020
by   Kunze Wang, et al.
0

Online abusive language detection (ALD) has become a societal issue of increasing importance in recent years. Several previous works in online ALD focused on solving a single abusive language problem in a single domain, like Twitter, and have not been successfully transferable to the general ALD task or domain. In this paper, we introduce a new generic ALD framework, MACAS, which is capable of addressing several types of ALD tasks across different domains. Our generic framework covers multi-aspect abusive language embeddings that represent the target and content aspects of abusive language and applies a textual graph embedding that analyses the user's linguistic behaviour. Then, we propose and use the cross-attention gate flow mechanism to embrace multiple aspects of abusive language. Quantitative and qualitative evaluation results show that our ALD algorithm rivals or exceeds the six state-of-the-art ALD algorithms across seven ALD datasets covering multiple aspects of abusive language and different online community domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2018

Neural Character-based Composition Models for Abuse Detection

The advent of social media in recent years has fed into some highly unde...
research
02/01/2018

A Unified Deep Learning Architecture for Abuse Detection

Hate speech, offensive language, sexism, racism and other types of abusi...
research
09/06/2019

Attending the Emotions to Detect Online Abusive Language

In recent years, abusive behavior has become a serious issue in online s...
research
05/28/2020

Joint Modelling of Emotion and Abusive Language Detection

The rise of online communication platforms has been accompanied by some ...
research
06/02/2021

Figurative Language in Recognizing Textual Entailment

We introduce a collection of recognizing textual entailment (RTE) datase...
research
07/11/2022

Learning Large-scale Universal User Representation with Sparse Mixture of Experts

Learning user sequence behaviour embedding is very sophisticated and cha...
research
03/08/2019

An Identification of Learners' Confusion through Language and Discourse Analysis

The substantial growth of online learning, in particular, Massively Open...

Please sign up or login with your details

Forgot password? Click here to reset