Log In Sign Up

SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets

by   Parth Patwa, et al.

In this paper, we present the results of the SemEval-2020 Task 9 on Sentiment Analysis of Code-Mixed Tweets (SentiMix 2020). We also release and describe our Hinglish (Hindi-English) and Spanglish (Spanish-English) corpora annotated with word-level language identification and sentence-level sentiment labels. These corpora are comprised of 20K and 19K examples, respectively. The sentiment labels are - Positive, Negative, and Neutral. SentiMix attracted 89 submissions in total including 61 teams that participated in the Hinglish contest and 28 submitted systems to the Spanglish competition. The best performance achieved was 75.0 BERT-like models and ensemble methods are the most common and successful approaches among the participants.


page 1

page 2

page 3

page 4


Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis

We explore the task of sentiment analysis on Hinglish (code-mixed Hindi-...

Overview of the Arabic Sentiment Analysis 2021 Competition at KAUST

This paper provides an overview of the Arabic Sentiment Analysis Challen...

Stance and Sentiment in Tweets

We can often detect from a person's utterances whether he/she is in favo...

DynaSent: A Dynamic Benchmark for Sentiment Analysis

We introduce DynaSent ('Dynamic Sentiment'), a new English-language benc...

A New Statistical Approach for Comparing Algorithms for Lexicon Based Sentiment Analysis

Lexicon based sentiment analysis usually relies on the identification of...

Sentiment analysis model for Twitter data in Polish language

Text mining analysis of tweets gathered during Polish presidential elect...