On the Subjectivity of Emotions in Software Projects: How Reliable are Pre-Labeled Data Sets for Sentiment Analysis?

07/16/2022
by   Marc Herrmann, et al.
0

Social aspects of software projects become increasingly important for research and practice. Different approaches analyze the sentiment of a development team, ranging from simply asking the team to so-called sentiment analysis on text-based communication. These sentiment analysis tools are trained using pre-labeled data sets from different sources, including GitHub and Stack Overflow. In this paper, we investigate if the labels of the statements in the data sets coincide with the perception of potential members of a software project team. Based on an international survey, we compare the median perception of 94 participants with the pre-labeled data sets as well as every single participant's agreement with the predefined labels. Our results point to three remarkable findings: (1) Although the median values coincide with the predefined labels of the data sets in 62.5 huge difference between the single participant's ratings and the labels; (2) there is not a single participant who totally agrees with the predefined labels; and (3) the data set whose labels are based on guidelines performs better than the ad hoc labeled data set.

READ FULL TEXT
research
06/22/2022

SEnti-Analyzer: Joint Sentiment Analysis For Text-Based and Verbal Communication in Software Projects

Social aspects in software development teams are of particular importanc...
research
05/06/2021

Development and Application of Sentiment Analysis Tools in Software Engineering: A Systematic Literature Review

Software development is a collaborative task and, hence, involves differ...
research
05/01/2018

Word2Vec and Doc2Vec in Unsupervised Sentiment Analysis of Clinical Discharge Summaries

In this study, we explored application of Word2Vec and Doc2Vec for senti...
research
07/04/2019

SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Software Engineering

Sentiment analysis has various application scenarios in software enginee...
research
05/15/2022

Adaptive Prompt Learning-based Few-Shot Sentiment Analysis

In the field of natural language processing, sentiment analysis via deep...
research
09/09/2017

Sentiment Polarity Detection for Software Development

The role of sentiment analysis is increasingly emerging to study softwar...
research
11/12/2022

Integrating Transformer and Autoencoder Techniques with Spectral Graph Algorithms for the Prediction of Scarcely Labeled Molecular Data

In molecular and biological sciences, experiments are expensive, time-co...

Please sign up or login with your details

Forgot password? Click here to reset