Improving Automatic Hate Speech Detection with Multiword Expression Features

06/01/2021
by   Nicolas Zampieri, et al.
0

The task of automatically detecting hate speech in social media is gaining more and more attention. Given the enormous volume of content posted daily, human monitoring of hate speech is unfeasible. In this work, we propose new word-level features for automatic hate speech detection (HSD): multiword expressions (MWEs). MWEs are lexical units greater than a word that have idiomatic and compositional meanings. We propose to integrate MWE features in a deep neural network-based HSD framework. Our baseline HSD system relies on Universal Sentence Encoder (USE). To incorporate MWE features, we create a three-branch deep neural network: one branch for USE, one for MWE categories, and one for MWE embeddings. We conduct experiments on two hate speech tweet corpora with different MWE categories and with two types of MWE embeddings, word2vec and BERT. Our experiments demonstrate that the proposed HSD system with MWE features significantly outperforms the baseline system in terms of macro-F1.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2020

CRAB: Class Representation Attentive BERT for Hate Speech Identification in Social Media

In recent years, social media platforms have hosted an explosion of hate...
research
06/29/2021

Hate speech detection using static BERT embeddings

With increasing popularity of social media platforms hate speech is emer...
research
08/28/2018

All You Need is "Love": Evading Hate-speech Detection

With the spread of social networks and their unfortunate use for hate sp...
research
02/27/2018

Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter

In recent years, the increasing propagation of hate speech on social med...
research
12/18/2017

Detecting Hate Speech in Social Media

In this paper we examine methods to detect hate speech in social media, ...
research
02/08/2021

A study of text representations in Hate Speech Detection

The pervasiveness of the Internet and social media have enabled the rapi...
research
03/29/2017

Automatic Argumentative-Zoning Using Word2vec

In comparison with document summarization on the articles from social me...

Please sign up or login with your details

Forgot password? Click here to reset