Hostility Detection in Hindi leveraging Pre-Trained Language Models

01/14/2021
by   Ojasv Kamal, et al.
0

Hostile content on social platforms is ever increasing. This has led to the need for proper detection of hostile posts so that appropriate action can be taken to tackle them. Though a lot of work has been done recently in the English Language to solve the problem of hostile content online, similar works in Indian Languages are quite hard to find. This paper presents a transfer learning based approach to classify social media (i.e Twitter, Facebook, etc.) posts in Hindi Devanagari script as Hostile or Non-Hostile. Hostile posts are further analyzed to determine if they are Hateful, Fake, Defamation, and Offensive. This paper harnesses attention based pre-trained models fine-tuned on Hindi data with Hostile-Non hostile task as Auxiliary and fusing its features for further sub-tasks classification. Through this approach, we establish a robust and consistent model without any ensembling or complex pre-processing. We have presented the results from our approach in CONSTRAINT-2021 Shared Task on hostile post detection where our model performs extremely well with 3rd runner up in terms of Weighted Fine-Grained F1 Score.

READ FULL TEXT
research
01/13/2021

Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings

Due to the wide adoption of social media platforms like Facebook, Twitte...
research
01/10/2021

Detecting Hostile Posts using Relational Graph Convolutional Network

This work is based on the submission to the competition Hindi Constraint...
research
04/16/2019

An Empirical Evaluation of Text Representation Schemes on Multilingual Social Web to Filter the Textual Aggression

This paper attempt to study the effectiveness of text representation sch...
research
07/12/2020

Fine-grained Language Identification with Multilingual CapsNet Model

Due to a drastic improvement in the quality of internet services worldwi...
research
02/05/2022

Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss

The number of increased social media users has led to a lot of people mi...
research
07/04/2022

Portuguese Man-of-War Image Classification with Convolutional Neural Networks

Portuguese man-of-war (PMW) is a gelatinous organism with long tentacles...
research
01/21/2022

Understanding and Detecting Hateful Content using Contrastive Learning

The spread of hate speech and hateful imagery on the Web is a significan...

Please sign up or login with your details

Forgot password? Click here to reset