Hope Speech detection in under-resourced Kannada language

08/10/2021
by   Adeep Hande, et al.
0

Numerous methods have been developed to monitor the spread of negativity in modern years by eliminating vulgar, offensive, and fierce comments from social media platforms. However, there are relatively lesser amounts of study that converges on embracing positivity, reinforcing supportive and reassuring content in online forums. Consequently, we propose creating an English-Kannada Hope speech dataset, KanHope and comparing several experiments to benchmark the dataset. The dataset consists of 6,176 user-generated comments in code mixed Kannada scraped from YouTube and manually annotated as bearing hope speech or Not-hope speech. In addition, we introduce DC-BERT4HOPE, a dual-channel model that uses the English translation of KanHope for additional training to promote hope speech detection. The approach achieves a weighted F1-score of 0.756, bettering other models. Henceforth, KanHope aims to instigate research in Kannada while broadly promoting researchers to take a pragmatic approach towards online content that encourages, positive, and supportive.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

IIITT@LT-EDI-EACL2021-Hope Speech Detection: There is always Hope in Transformers

In a world filled with serious challenges like climate change, religious...
research
10/05/2020

Gauravarora@HASOC-Dravidian-CodeMix-FIRE2020: Pre-training ULMFiT on Synthetically Generated Code-Mixed Data for Hate Speech Detection

This paper describes the system submitted to Dravidian-Codemix-HASOC2020...
research
02/28/2021

NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner

In recent years, several systems have been developed to regulate the spr...
research
10/25/2022

PolyHope: Two-Level Hope Speech Detection from Tweets

Hope is characterized as openness of spirit toward the future, a desire,...
research
05/24/2021

Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

Abusive language is a massive problem in online social platforms. Existi...
research
04/07/2022

Korean Online Hate Speech Dataset for Multilabel Classification: How Can Social Science Improve Dataset on Hate Speech?

We suggest a multilabel Korean online hate speech dataset that covers se...
research
06/11/2020

ETHOS: an Online Hate Speech Detection Dataset

Online hate speech is a newborn problem in our modern society which is g...

Please sign up or login with your details

Forgot password? Click here to reset