Interpretable Multi Labeled Bengali Toxic Comments Classification using Deep Learning

04/08/2023
by   Tanveer Ahmed Belal, et al.
0

This paper presents a deep learning-based pipeline for categorizing Bengali toxic comments, in which at first a binary classification model is used to determine whether a comment is toxic or not, and then a multi-label classifier is employed to determine which toxicity type the comment belongs to. For this purpose, we have prepared a manually labeled dataset consisting of 16,073 instances among which 8,488 are Toxic and any toxic comment may correspond to one or more of the six toxic categories - vulgar, hate, religious, threat, troll, and insult simultaneously. Long Short Term Memory (LSTM) with BERT Embedding achieved 89.42 a multi-label classifier, a combination of Convolutional Neural Network and Bi-directional Long Short Term Memory (CNN-BiLSTM) with attention mechanism achieved 78.92 predictions and interpret the word feature importance during classification by the proposed models, we utilized Local Interpretable Model-Agnostic Explanations (LIME) framework. We have made our dataset public and can be accessed at - https://github.com/deepu099cse/Multi-Labeled-Bengali-Toxic-Comments-Classification

READ FULL TEXT

page 1

page 5

research
03/03/2021

Malware Classification Using Long Short-Term Memory Models

Signature and anomaly based techniques are the quintessential approaches...
research
06/01/2020

A multimodal approach for multi-label movie genre classification

Movie genre classification is a challenging task that has increasingly a...
research
08/03/2021

Automatic classification of eclipsing binary stars using deep learning methods

In the last couple of decades, tremendous progress has been achieved in ...
research
11/26/2022

An Automatic SOAP Classification System Using Weakly Supervision And Transfer Learning

In this paper, we introduce a comprehensive framework for developing a m...
research
12/19/2016

Few-Shot Object Recognition from Machine-Labeled Web Images

With the tremendous advances of Convolutional Neural Networks (ConvNets)...
research
09/08/2018

Multi-label Classification of User Reactions in Online News

The increase in the number of Internet users and the strong interaction ...

Please sign up or login with your details

Forgot password? Click here to reset