Sentiment Analysis of Code-Mixed Social Media Text (Hinglish)

02/24/2021
by   Gaurav Singh, et al.
0

This paper discusses the results obtained for different techniques applied for performing the sentiment analysis of social media (Twitter) code-mixed text written in Hinglish. The various stages involved in performing the sentiment analysis were data consolidation, data cleaning, data transformation and modelling. Various data cleaning techniques were applied, data was cleaned in five iterations and the results of experiments conducted were noted after each iteration. Data was transformed using count vectorizer, one hot vectorizer, tf-idf vectorizer, doc2vec, word2vec and fasttext embeddings. The models were created using various machine learning algorithms such as SVM, KNN, Decision Trees, Random Forests, Naive Bayes, Logistic Regression, and ensemble voting classifiers. The data was obtained from a task on Codalab competition website which was listed as Task:9 on the Semeval-2020 competition website. The models created were evaluated using the F1-score (macro). The best F1-score of 69.07 was achieved using ensemble voting classifier.

READ FULL TEXT
research
08/09/2018

Code-Mixed Sentiment Analysis Using Machine Learning and Neural Network Approaches

Sentiment Analysis for Indian Languages (SAIL)-Code Mixed tools contest ...
research
09/21/2020

WESSA at SemEval-2020 Task 9: Code-Mixed Sentiment Analysis using Transformers

In this paper, we describe our system submitted for SemEval 2020 Task 9,...
research
08/26/2020

Decision Tree J48 at SemEval-2020 Task 9: Sentiment Analysis for Code-Mixed Social Media Text (Hinglish)

This paper discusses the design of the system used for providing a solut...
research
07/03/2015

Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

This paper covers the two approaches for sentiment analysis: i) lexicon ...
research
12/22/2021

Multimodal Analysis of memes for sentiment extraction

Memes are one of the most ubiquitous forms of social media communication...
research
06/14/2020

Application of Data Science to Discover Violence-Related Issues in Iraq

Data science has been satisfactorily used to discover social issues in s...
research
06/13/2021

SASICM A Multi-Task Benchmark For Subtext Recognition

Subtext is a kind of deep semantics which can be acquired after one or m...

Please sign up or login with your details

Forgot password? Click here to reset