Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings

01/13/2021
by   Arkadipta De, et al.
0

Due to the wide adoption of social media platforms like Facebook, Twitter, etc., there is an emerging need of detecting online posts that can go against the community acceptance standards. The hostility detection task has been well explored for resource-rich languages like English, but is unexplored for resource-constrained languages like Hindidue to the unavailability of large suitable data. We view this hostility detection as a multi-label multi-class classification problem. We propose an effective neural network-based technique for hostility detection in Hindi posts. We leverage pre-trained multilingual Bidirectional Encoder Representations of Transformer (mBERT) to obtain the contextual representations of Hindi posts. We have performed extensive experiments including different pre-processing techniques, pre-trained models, neural architectures, hybrid strategies, etc. Our best performing neural classifier model includes One-vs-the-Rest approach where we obtained 92.60 81.14 and defamation labels respectively. The proposed model outperformed the existing baseline models and emerged as the state-of-the-art model for detecting hostility in the Hindi posts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2021

Hostility Detection in Hindi leveraging Pre-Trained Language Models

Hostile content on social platforms is ever increasing. This has led to ...
research
01/15/2021

Walk in Wild: An Ensemble Approach for Hostility Detection in Hindi Posts

As the reach of the internet increases, pejorative terms started floodin...
research
01/10/2021

Detecting Hostile Posts using Relational Graph Convolutional Network

This work is based on the submission to the competition Hindi Constraint...
research
01/11/2021

Evaluation of Deep Learning Models for Hostility Detection in Hindi Text

The social media platform is a convenient medium to express personal tho...
research
04/16/2019

An Empirical Evaluation of Text Representation Schemes on Multilingual Social Web to Filter the Textual Aggression

This paper attempt to study the effectiveness of text representation sch...
research
06/08/2022

Improved two-stage hate speech classification for twitter based on Deep Neural Networks

Hate speech is a form of online harassment that involves the use of abus...
research
10/26/2022

A Transformer-based Framework for POI-level Social Post Geolocation

POI-level geo-information of social posts is critical to many location-b...

Please sign up or login with your details

Forgot password? Click here to reset