Data Augmentation for Improving Emotion Recognition in Software Engineering Communication

08/10/2022
by   Mia Mohammad Imran, et al.
0

Emotions (e.g., Joy, Anger) are prevalent in daily software engineering (SE) activities, and are known to be significant indicators of work productivity (e.g., bug fixing efficiency). Recent studies have shown that directly applying general purpose emotion classification tools to SE corpora is not effective. Even within the SE domain, tool performance degrades significantly when trained on one communication channel and evaluated on another (e.g, StackOverflow vs. GitHub comments). Retraining a tool with channel-specific data takes significant effort since manually annotating large datasets of ground truth data is expensive. In this paper, we address this data scarcity problem by automatically creating new training data using a data augmentation technique. Based on an analysis of the types of errors made by popular SE-specific emotion recognition tools, we specifically target our data augmentation strategy in order to improve the performance of emotion recognition. Our results show an average improvement of 9.3 tools (ESEM-E, EMTk, SEntiMoji) when trained with our best augmentation strategy.

READ FULL TEXT
research
04/18/2021

Best Practices for Noise-Based Augmentation to Improve the Performance of Emotion Recognition "In the Wild"

Emotion recognition as a key component of high-stake downstream applicat...
research
10/27/2020

CopyPaste: An Augmentation Method for Speech Emotion Recognition

Data augmentation is a widely used strategy for training robust machine ...
research
09/20/2020

A Benchmark Study of the Contemporary Toxicity Detectors on Software Engineering Interactions

Automated filtering of toxic conversations may help an Open-source softw...
research
05/05/2022

M2R2: Missing-Modality Robust emotion Recognition framework with iterative data augmentation

This paper deals with the utterance-level modalities missing problem wit...
research
10/11/2021

Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?

Assessing the personality of software engineers may help to match indivi...
research
05/16/2023

Data Augmentation for Conflict and Duplicate Detection in Software Engineering Sentence Pairs

This paper explores the use of text data augmentation techniques to enha...
research
08/09/2022

Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition

Speech Emotion Recognition (SER) is crucial for human-computer interacti...

Please sign up or login with your details

Forgot password? Click here to reset