Building a Sentiment Corpus of Tweets in Brazilian Portuguese

12/24/2017
by   Henrico Bertini Brum, et al.
0

The large amount of data available in social media, forums and websites motivates researches in several areas of Natural Language Processing, such as sentiment analysis. The popularity of the area due to its subjective and semantic characteristics motivates research on novel methods and approaches for classification. Hence, there is a high demand for datasets on different domains and different languages. This paper introduces TweetSentBR, a sentiment corpora for Brazilian Portuguese manually annotated with 15.000 sentences on TV show domain. The sentences were labeled in three classes (positive, neutral and negative) by seven annotators, following literature guidelines for ensuring reliability on the annotation. We also ran baseline experiments on polarity classification using three machine learning methods, reaching 80.99 F-Measure and 82.06 and 64.62

READ FULL TEXT
research
03/21/2021

L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset

Sentiment analysis is one of the most fundamental tasks in Natural Langu...
research
10/25/2020

Transgender Community Sentiment Analysis from Social Media Data: A Natural Language Processing Approach

Transgender community is experiencing a huge disparity in mental health ...
research
07/09/2017

PELESent: Cross-domain polarity classification using distant supervision

The enormous amount of texts published daily by Internet users has foste...
research
04/03/2018

Sentiment Analysis of Code-Mixed Languages leveraging Resource Rich Languages

Code-mixed data is an important challenge of natural language processing...
research
08/25/2020

The Impact of Indirect Machine Translation on Sentiment Classification

Sentiment classification has been crucial for many natural language proc...
research
01/23/2018

SentiPers: A Sentiment Analysis Corpus for Persian

Sentiment Analysis (SA) is a major field of study in natural language pr...
research
08/10/2022

The Moral Foundations Reddit Corpus

Moral framing and sentiment can affect a variety of online and offline b...

Please sign up or login with your details

Forgot password? Click here to reset