DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language

12/28/2020
by   Md. Rezaul Karim, et al.
28

Exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices, but also enables people to express anti-social behavior like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize these data for social and anti-social behavior analysis, by predicting the contexts mostly for highly-resourced languages like English. However, some languages such as Bengali are under-resourced that lack of computational resources for natural language processing(NLP). In this paper, we propose an explainable approach for hate speech detection from under-resourced Bengali language, which we called DeepHateExplainer. In our approach, Bengali texts are first comprehensively preprocessed, before classifying them into political, personal, geopolitical, and religious hates, by employing neural ensemble of different transformer-based neural architectures(i.e., monolingual Bangla BERT-base, multilingual BERT-cased and uncased, and XLM-RoBERTa), followed by identifying important terms with sensitivity analysis and layer-wise relevance propagation(LRP) to provide human-interpretable explanations. Evaluations against several machine learning (linear and tree-based models) and deep neural networks (i.e., CNN, Bi-LSTM, and Conv-LSTM with word embeddings) baselines yield F1 scores of 84 geopolitical, and religious hates, respectively, during 3-fold cross-validation tests.

READ FULL TEXT

page 4

page 6

page 8

research
04/11/2020

Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network

Exponential growths of social media and micro-blogging sites not only pr...
research
04/19/2022

Multimodal Hate Speech Detection from Bengali Memes and Texts

Numerous works have been proposed to employ machine learning (ML) and de...
research
01/11/2021

Evaluation of Deep Learning Models for Hostility Detection in Hindi Text

The social media platform is a convenient medium to express personal tho...
research
10/23/2021

Hate and Offensive Speech Detection in Hindi and Marathi

Sentiment analysis is the most basic NLP task to determine the polarity ...
research
06/26/2022

Explainable and High-Performance Hate and Offensive Speech Detection

The spread of information through social media platforms can create envi...
research
08/08/2021

Efficacy of BERT embeddings on predicting disaster from Twitter data

Social media like Twitter provide a common platform to share and communi...
research
10/10/2021

amsqr at SemEval-2020 Task 12: Offensive language detection using neural networks and anti-adversarial features

This paper describes a method and system to solve the problem of detecti...

Please sign up or login with your details

Forgot password? Click here to reset