Generating High-Quality Emotion Arcs For Low-Resource Languages Using Emotion Lexicons

06/03/2023
by   Daniela Teodorescu, et al.
0

Automatically generated emotion arcs – that capture how an individual or a population feels over time – are widely used in industry and research. However, there is little work on evaluating the generated arcs in English (where the emotion resources are available) and no work on generating or evaluating emotion arcs for low-resource languages. Work on generating emotion arcs in low-resource languages such as those indigenous to Africa, the Americas, and Australia is stymied by the lack of emotion-labeled resources and large language models for those languages. Work on evaluating emotion arcs (for any language) is scarce because of the difficulty of establishing the true (gold) emotion arc. Our work, for the first time, systematically and quantitatively evaluates automatically generated emotion arcs. We also compare two common ways of generating emotion arcs: Machine-Learning (ML) models and Lexicon-Only (LexO) methods. By running experiments on 42 diverse datasets in 9 languages, we show that despite being markedly poor at instance level emotion classification, LexO methods are highly accurate at generating emotion arcs when aggregating information from hundreds of instances. (Predicted arcs have correlations ranging from 0.94 to 0.99 with the gold arcs for various emotions.) We also show that for languages with no emotion lexicons, automatic translations of English emotion lexicons can be used to generate high-quality emotion arcs – correlations above 0.9 with the gold emotion arcs in all six indigenous African languages explored. This opens up avenues for work on emotions in numerous languages from around the world; crucial not only for commerce, public policy, and health research in service of speakers of those languages, but also to draw meaningful conclusions in emotion-pertinent research using information from around the world (thereby avoiding a western-centric bias in research).

READ FULL TEXT

page 9

page 10

page 12

page 13

page 15

page 16

page 20

page 32

research
10/13/2022

Frustratingly Easy Sentiment Analysis of Text Streams: Generating High-Quality Emotion Arcs Using Emotion Lexicons

Automatically generated emotion arcs – that capture how an individual or...
research
10/12/2022

Transformer-based Text Classification on Unified Bangla Multi-class Emotion Corpus

Because of its importance in studying people's thoughts on various Web 2...
research
05/12/2020

Learning and Evaluating Emotion Lexicons for 91 Languages

Emotion lexicons describe the affective meaning of words and thus consti...
research
07/02/2018

Representation Mapping: A Novel Approach to Generate High-Quality Multi-Lingual Emotion Lexicons

In the past years, sentiment analysis has increasingly shifted attention...
research
06/09/2021

MICE: A Crosslinguistic Emotion Corpus in Malay, Indonesian, Chinese and English

MICE is a corpus of emotion words in four languages which is currently w...
research
05/01/2019

A system for the 2019 Sentiment, Emotion and Cognitive State Task of DARPAs LORELEI project

During the course of a Humanitarian Assistance-Disaster Relief (HADR) cr...
research
04/17/2021

Emotion Classification in a Resource Constrained Language Using Transformer-based Approach

Although research on emotion classification has significantly progressed...

Please sign up or login with your details

Forgot password? Click here to reset