Sexism Identification in Tweets and Gabs using Deep Neural Networks

11/05/2021
by   Amikul Kalra, et al.
0

Through anonymisation and accessibility, social media platforms have facilitated the proliferation of hate speech, prompting increased research in developing automatic methods to identify these texts. This paper explores the classification of sexism in text using a variety of deep neural network model architectures such as Long-Short-Term Memory (LSTMs) and Convolutional Neural Networks (CNNs). These networks are used in conjunction with transfer learning in the form of Bidirectional Encoder Representations from Transformers (BERT) and DistilBERT models, along with data augmentation, to perform binary and multiclass sexism classification on the dataset of tweets and gabs from the sEXism Identification in Social neTworks (EXIST) task in IberLEF 2021. The models are seen to perform comparatively to those from the competition, with the best performances seen using BERT and a multi-filter CNN model. Data augmentation further improves these results for the multi-class classification task. This paper also explores the errors made by the models and discusses the difficulty in automatically classifying sexism due to the subjectivity of the labels and the complexity of natural language used in social media.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2019

Hate Speech Detection on Vietnamese Social Media Text using the Bidirectional-LSTM Model

In this paper, we describe our system which participates in the shared t...
research
12/20/2022

A Twitter BERT Approach for Offensive Language Detection in Marathi

Automated offensive language detection is essential in combating the spr...
research
09/18/2020

NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative

Millions of people around the world are sharing COVID-19 related informa...
research
04/01/2019

Cyberthreat Detection from Twitter using Deep Neural Networks

To be prepared against cyberattacks, most organizations resort to securi...
research
10/07/2020

Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets

In this work, we introduce the methods proposed by the UnibucKernel team...
research
12/05/2018

Text Data Augmentation Made Simple By Leveraging NLP Cloud APIs

In practice, it is common to find oneself with far too little text data ...
research
07/28/2020

YNU-HPCC at SemEval-2020 Task 8: Using a Parallel-Channel Model for Memotion Analysis

In recent years, the growing ubiquity of Internet memes on social media ...

Please sign up or login with your details

Forgot password? Click here to reset