BanglaSarc: A Dataset for Sarcasm Detection

09/27/2022
by   Tasnim Sakib Apon, et al.
0

Being one of the most widely spoken language in the world, the use of Bangla has been increasing in the world of social media as well. Sarcasm is a positive statement or remark with an underlying negative motivation that is extensively employed in today's social media platforms. There has been a significant improvement in sarcasm detection in English over the previous many years, however the situation regarding Bangla sarcasm detection remains unchanged. As a result, it is still difficult to identify sarcasm in bangla, and a lack of high-quality data is a major contributing factor. This article proposes BanglaSarc, a dataset constructed specifically for bangla textual data sarcasm detection. This dataset contains of 5112 comments/status and contents collected from various online social platforms such as Facebook, YouTube, along with a few online blogs. Due to the limited amount of data collection of categorized comments in Bengali, this dataset will aid in the of study identifying sarcasm, recognizing people's emotion, detecting various types of Bengali expressions, and other domains. The dataset is publicly available at https://www.kaggle.com/datasets/sakibapon/banglasarc.

READ FULL TEXT

page 3

page 4

page 5

research
02/04/2021

Bangla Text Dataset and Exploratory Analysis for Online Harassment Detection

Being the seventh most spoken language in the world, the use of the Bang...
research
03/22/2023

Interpretable Bangla Sarcasm Detection using BERT and Explainable AI

A positive phrase or a sentence with an underlying negative motive is us...
research
06/11/2020

ETHOS: an Online Hate Speech Detection Dataset

Online hate speech is a newborn problem in our modern society which is g...
research
11/16/2018

Improving Rotated Text Detection with Rotation Region Proposal Networks

A significant number of images shared on social media platforms such as ...
research
05/24/2021

Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

Abusive language is a massive problem in online social platforms. Existi...
research
05/23/2022

Towards automatic detection of wildlife trade using machine vision models

Unsustainable trade in wildlife is one of the major threats affecting th...
research
06/08/2021

Cyberbullying Detection Using Deep Neural Network from Social Media Comments in Bangla Language

Cyberbullying or Online harassment detection on social media for various...

Please sign up or login with your details

Forgot password? Click here to reset