ArmanEmo: A Persian Dataset for Text-based Emotion Detection

07/24/2022
by   Hossein Mirzaee, et al.
0

With the recent proliferation of open textual data on social media platforms, Emotion Detection (ED) from Text has received more attention over the past years. It has many applications, especially for businesses and online service providers, where emotion detection techniques can help them make informed commercial decisions by analyzing customers/users' feelings towards their products and services. In this study, we introduce ArmanEmo, a human-labeled emotion dataset of more than 7000 Persian sentences labeled for seven categories. The dataset has been collected from different resources, including Twitter, Instagram, and Digikala (an Iranian e-commerce company) comments. Labels are based on Ekman's six basic emotions (Anger, Fear, Happiness, Hatred, Sadness, Wonder) and another category (Other) to consider any other emotion not included in Ekman's model. Along with the dataset, we have provided several baseline models for emotion classification focusing on the state-of-the-art transformer-based language models. Our best model achieves a macro-averaged F1 score of 75.39 percent across our test dataset. Moreover, we also conduct transfer learning experiments to compare our proposed dataset's generalization against other Persian emotion datasets. Results of these experiments suggest that our dataset has superior generalizability among the existing Persian emotion datasets. ArmanEmo is publicly available for non-commercial use at https://github.com/Arman-Rayan-Sharif/arman-text-emotion.

READ FULL TEXT
research
04/19/2022

Optimize_Prime@DravidianLangTech-ACL2022: Emotion Analysis in Tamil

This paper aims to perform an emotion analysis of social media comments ...
research
11/15/2022

Persian Emotion Detection using ParsBERT and Imbalanced Data Handling Approaches

Emotion recognition is one of the machine learning applications which ca...
research
02/28/2023

Automatically Classifying Emotions based on Text: A Comparative Exploration of Different Datasets

Emotion Classification based on text is a task with many applications wh...
research
08/05/2019

Performance Evaluation of Supervised Machine Learning Techniques for Efficient Detection of Emotions from Online Content

Emotion detection from the text is an important and challenging problem ...
research
07/08/2022

Emotion detection of social data: APIs comparative study

The development of emotion detection technology has emerged as a highly ...
research
10/25/2019

DENS: A Dataset for Multi-class Emotion Analysis

We introduce a new dataset for multi-class emotion analysis from long-fo...
research
02/09/2022

TamilEmo: Finegrained Emotion Detection Dataset for Tamil

Emotional Analysis from textual input has been considered both a challen...

Please sign up or login with your details

Forgot password? Click here to reset