Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

06/08/2018
by   Ethem F. Can, et al.
0

Sentiment analysis is a widely studied NLP task where the goal is to determine opinions, emotions, and evaluations of users towards a product, an entity or a service that they are reviewing. One of the biggest challenges for sentiment analysis is that it is highly language dependent. Word embeddings, sentiment lexicons, and even annotated data are language specific. Further, optimizing models for each language is very time consuming and labor intensive especially for recurrent neural network models. From a resource perspective, it is very challenging to collect data for different languages. In this paper, we look for an answer to the following research question: can a sentiment analysis model trained on a language be reused for sentiment analysis in other languages, Russian, Spanish, Turkish, and Dutch, where the data is more limited? Our goal is to build a single model in the language with the largest dataset available for the task, and reuse it for languages that have limited resources. For this purpose, we train a sentiment analysis model using recurrent neural networks with reviews in English. We then translate reviews in other languages and reuse this model to evaluate the sentiments. Experimental results show that our robust approach of single model trained on English reviews statistically significantly outperforms the baselines in several different languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2019

Pars-ABSA: An Aspect-based Sentiment Analysis Dataset in Persian

Due to the increased availability of online reviews, sentiment analysis ...
research
09/07/2016

Sentiment Classification of Food Reviews

Sentiment analysis of reviews is a popular task in natural language proc...
research
01/01/2016

Sentiment/Subjectivity Analysis Survey for Languages other than English

Subjective and sentiment analysis have gained considerable attention rec...
research
02/06/2017

Q-WordNet PPV: Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages

This paper presents a simple, robust and (almost) unsupervised dictionar...
research
04/03/2018

Emotions are Universal: Learning Sentiment Based Representations of Resource-Poor Languages using Siamese Networks

Machine learning approaches in sentiment analysis principally rely on th...
research
07/18/2020

A novel approach to sentiment analysis in Persian using discourse and external semantic information

Sentiment analysis attempts to identify, extract and quantify affective ...
research
12/16/2021

Explainable Natural Language Processing with Matrix Product States

Despite empirical successes of recurrent neural networks (RNNs) in natur...

Please sign up or login with your details

Forgot password? Click here to reset