Log In Sign Up

L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset

by   Atharva Kulkarni, et al.

Sentiment analysis is one of the most fundamental tasks in Natural Language Processing. Popular languages like English, Arabic, Russian, Mandarin, and also Indian languages such as Hindi, Bengali, Tamil have seen a significant amount of work in this area. However, the Marathi language which is the third most popular language in India still lags behind due to the absence of proper datasets. In this paper, we present the first major publicly available Marathi Sentiment Analysis Dataset - L3CubeMahaSent. It is curated using tweets extracted from various Maharashtrian personalities' Twitter accounts. Our dataset consists of  16,000 distinct tweets classified in three broad classes viz. positive, negative, and neutral. We also present the guidelines using which we annotated the tweets. Finally, we present the statistics of our dataset and baseline classification results using CNN, LSTM, ULMFiT, and BERT-based deep learning models.


Sentiment Analysis at SEPLN (TASS)-2019: Sentiment Analysis at Tweet level using Deep Learning

This paper describes the system submitted to "Sentiment Analysis at SEPL...

Bambara Language Dataset for Sentiment Analysis

For easier communication, posting, or commenting on each others posts, p...

Sentiment Analysis for Sinhala Language using Deep Learning Techniques

Due to the high impact of the fast-evolving fields of machine learning a...

Comparing methods for Twitter Sentiment Analysis

This work extends the set of works which deal with the popular problem o...

Building a Sentiment Corpus of Tweets in Brazilian Portuguese

The large amount of data available in social media, forums and websites ...

Sentiment analysis in tweets: an assessment study from classical to modern text representation models

With the growth of social medias, such as Twitter, plenty of user-genera...

HashSet – A Dataset For Hashtag Segmentation

Hashtag segmentation is the task of breaking a hashtag into its constitu...