Identifying and Categorizing Offensive Language in Social Media

04/10/2021
by   Nikhil Oswal, et al.
0

Offensive language is pervasive in social media. Individuals frequently take advantage of the perceived anonymity of computer-mediated communication, using this to engage in behavior that many of them would not consider in real life. The automatic identification of offensive content online is an important task that has gained more attention in recent years. This task can be modeled as a supervised classification problem in which systems are trained using a dataset containing posts that are annotated with respect to the presence of some form(s) of abusive or offensive content. The objective of this study is to provide a description of a classification system built for SemEval-2019 Task 6: OffensEval. This system classifies a tweet as either offensive or not offensive (Sub-task A) and further classifies offensive tweets into categories (Sub-tasks B & C). We trained machine learning and deep learning models along with data preprocessing and sampling techniques to come up with the best results. Models discussed include Naive Bayes, SVM, Logistic Regression, Random Forest and LSTM.

READ FULL TEXT

page 5

page 12

research
03/19/2019

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)

This paper presents the results and main findings of the shared task on ...
research
10/29/2019

Detect Toxic Content to Improve Online Conversations

Social media is filled with toxic content. The aim of this paper is to b...
research
02/11/2023

Emotion Detection From Social Media Posts

Over the last few years, social media has evolved into a medium for expr...
research
03/04/2023

RweetMiner: Automatic identification and categorization of help requests on twitter during disasters

Catastrophic events create uncertain situations for humanitarian organiz...
research
01/09/2020

Offensive Language Detection: A Comparative Analysis

Offensive behaviour has become pervasive in the Internet community. Indi...
research
07/25/2020

Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression

This paper describes the Duluth systems that participated in SemEval–202...
research
02/24/2019

On the Use of Emojis to Train Emotion Classifiers

Nowadays, the automatic detection of emotions is employed by many applic...

Please sign up or login with your details

Forgot password? Click here to reset