L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset

03/21/2021
by   Atharva Kulkarni, et al.
0

Sentiment analysis is one of the most fundamental tasks in Natural Language Processing. Popular languages like English, Arabic, Russian, Mandarin, and also Indian languages such as Hindi, Bengali, Tamil have seen a significant amount of work in this area. However, the Marathi language which is the third most popular language in India still lags behind due to the absence of proper datasets. In this paper, we present the first major publicly available Marathi Sentiment Analysis Dataset - L3CubeMahaSent. It is curated using tweets extracted from various Maharashtrian personalities' Twitter accounts. Our dataset consists of  16,000 distinct tweets classified in three broad classes viz. positive, negative, and neutral. We also present the guidelines using which we annotated the tweets. Finally, we present the statistics of our dataset and baseline classification results using CNN, LSTM, ULMFiT, and BERT-based deep learning models.

READ FULL TEXT
research
08/01/2019

Sentiment Analysis at SEPLN (TASS)-2019: Sentiment Analysis at Tweet level using Deep Learning

This paper describes the system submitted to "Sentiment Analysis at SEPL...
research
08/05/2021

Bambara Language Dataset for Sentiment Analysis

For easier communication, posting, or commenting on each others posts, p...
research
11/14/2020

Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Due to the high impact of the fast-evolving fields of machine learning a...
research
05/12/2015

Comparing methods for Twitter Sentiment Analysis

This work extends the set of works which deal with the popular problem o...
research
12/24/2017

Building a Sentiment Corpus of Tweets in Brazilian Portuguese

The large amount of data available in social media, forums and websites ...
research
05/29/2021

Sentiment analysis in tweets: an assessment study from classical to modern text representation models

With the growth of social medias, such as Twitter, plenty of user-genera...
research
01/18/2022

HashSet – A Dataset For Hashtag Segmentation

Hashtag segmentation is the task of breaking a hashtag into its constitu...

Please sign up or login with your details

Forgot password? Click here to reset