UniSent: Universal Adaptable Sentiment Lexica for 1000+ Languages

04/21/2019
by   Ehsaneddin Asgari, et al.
0

In this paper, we introduce UniSent a universal sentiment lexica for 1000 languages created using an English sentiment lexicon and a massively parallel corpus in the Bible domain. To the best of our knowledge, UniSent is the largest sentiment resource to date in terms of number of covered languages, including many low resource languages. To create UniSent, we propose Adapted Sentiment Pivot, a novel method that combines annotation projection, vocabulary expansion, and unsupervised domain adaptation. We evaluate the quality of UniSent for Macedonian, Czech, German, Spanish, and French and show that its quality is comparable to manually or semi-manually created sentiment resources. With the publication of this paper, we release UniSent lexica as well as Adapted Sentiment Pivot related codes. method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis

Sentiment analysis is one of the most widely studied applications in NLP...
research
01/23/2018

SentiPers: A Sentiment Analysis Corpus for Persian

Sentiment Analysis (SA) is a major field of study in natural language pr...
research
04/24/2020

Development of a General Purpose Sentiment Lexicon for Igbo Language

There are publicly available general purpose sentiment lexicons in some ...
research
12/05/2022

Impact of Domain-Adapted Multilingual Neural Machine Translation in the Medical Domain

Multilingual Neural Machine Translation (MNMT) models leverage many lang...
research
04/03/2018

Emotions are Universal: Learning Sentiment Based Representations of Resource-Poor Languages using Siamese Networks

Machine learning approaches in sentiment analysis principally rely on th...
research
04/28/2017

Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages

We present SuperPivot, an analysis method for low-resource languages tha...
research
04/29/2020

TUNIZI: a Tunisian Arabizi sentiment analysis Dataset

On social media, Arabic people tend to express themselves in their own l...

Please sign up or login with your details

Forgot password? Click here to reset