Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language

10/18/2021
by   Arushi Sharma, et al.
0

Social media has become a bedrock for people to voice their opinions worldwide. Due to the greater sense of freedom with the anonymity feature, it is possible to disregard social etiquette online and attack others without facing severe consequences, inevitably propagating hate speech. The current measures to sift the online content and offset the hatred spread do not go far enough. One factor contributing to this is the prevalence of regional languages in social media and the paucity of language flexible hate speech detectors. The proposed work focuses on analyzing hate speech in Hindi-English code-switched language. Our method explores transformation techniques to capture precise text representation. To contain the structure of data and yet use it with existing algorithms, we developed MoH or Map Only Hindi, which means "Love" in Hindi. MoH pipeline consists of language identification, Roman to Devanagari Hindi transliteration using a knowledge base of Roman Hindi words. Finally, it employs the fine-tuned Multilingual Bert and MuRIL language models. We conducted several quantitative experiment studies on three datasets and evaluated performance using Precision, Recall, and F1 metrics. The first experiment studies MoH mapped text's performance with classical machine learning models and shows an average increase of 13 compares the proposed work's scores with those of the baseline models and offers a rise in performance by 6 technique with various data simulations using the existing transliteration library. Here, MoH outperforms the rest by 15 significant improvement in the state-of-the-art scores on all three datasets.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 18

page 24

page 42

05/11/2021

Role of Artificial Intelligence in Detection of Hateful Speech for Hinglish Data on Social Media

Social networking platforms provide a conduit to disseminate our ideas, ...
09/29/2021

One to rule them all: Towards Joint Indic Language Hate Speech Detection

This paper is a contribution to the Hate Speech and Offensive Content Id...
01/08/2021

Leveraging Multilingual Transformers for Hate Speech Detection

Detecting and classifying instances of hate in social media text has bee...
01/23/2018

The Enemy Among Us: Detecting Hate Speech with Threats Based 'Othering' Language Embeddings

Offensive or antagonistic language targeted at individuals and social gr...
11/23/2020

An Online Multilingual Hate speech Recognition System

The exponential increase in the use of the Internet and social media ove...
06/24/2021

Hate Speech Detection in Clubhouse

With the rise of voice chat rooms, a gigantic resource of data can be ex...
06/30/2021

Whose Opinions Matter? Perspective-aware Models to Identify Opinions of Hate Speech Victims in Abusive Language Detection

Social media platforms provide users the freedom of expression and a med...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.