Tag that issue: Applying API-domain labels in issue tracking systems

04/06/2023
by   Fabio Santos, et al.
0

Labeling issues with the skills required to complete them can help contributors to choose tasks in Open Source Software projects. However, manually labeling issues is time-consuming and error-prone, and current automated approaches are mostly limited to classifying issues as bugs/non-bugs. We investigate the feasibility and relevance of automatically labeling issues with what we call "API-domains," which are high-level categories of APIs. Therefore, we posit that the APIs used in the source code affected by an issue can be a proxy for the type of skills (e.g., DB, security, UI) needed to work on the issue. We ran a user study (n=74) to assess API-domain labels' relevancy to potential contributors, leveraged the issues' descriptions and the project history to build prediction models, and validated the predictions with contributors (n=20) of the projects. Our results show that (i) newcomers to the project consider API-domain labels useful in choosing tasks, (ii) labels can be predicted with a precision of 84 results of the predictions reached up to 71.3 when training with a project and testing in another (transfer learning), and (iv) project contributors consider most of the predictions helpful in identifying needed skills. These findings suggest our approach can be applied in practice to automatically label issues, assisting developers in finding tasks that better match their skills.

READ FULL TEXT
research
03/23/2021

Can I Solve It? Identifying APIs Required to Complete OSS Task

Open Source Software projects add labels to open issues to help contribu...
research
03/23/2023

GiveMeLabeledIssues: An Open Source Issue Recommendation System

Developers often struggle to navigate an Open Source Software (OSS) proj...
research
11/02/2022

Supporting the Task-driven Skill Identification in Open Source Project Issue Tracking Systems

Selecting an appropriate task is challenging for contributors to Open So...
research
08/10/2021

Issue Link Label Recovery and Prediction for Open Source Software

Modern open source software development heavily relies on the issue trac...
research
09/20/2022

Comparative analysis of real bugs in open-source Machine Learning projects – A Registered Report

Background: Machine Learning (ML) systems rely on data to make predictio...
research
11/11/2020

Leveraging the Defects Life Cycle to Label Affected Versions and Defective Classes

Two recent studies explicitly recommend labeling defective classes in re...
research
08/01/2019

Learning to Identify Security-Related Issues Using Convolutional Neural Networks

Software security is becoming a high priority for both large companies a...

Please sign up or login with your details

Forgot password? Click here to reset