Automated Recovery of Issue-Commit Links Leveraging Both Textual and Non-textual Data

07/05/2021
by   Pooya Rostami Mazrae, et al.
0

An issue documents discussions around required changes in issue-tracking systems, while a commit contains the change itself in the version control systems. Recovering links between issues and commits can facilitate many software evolution tasks such as bug localization, and software documentation. A previous study on over half a million issues from GitHub reports only about 42.2 Automating the linking of commit-issue pairs can contribute to the improvement of the said tasks. By far, current state-of-the-art approaches for automated commit-issue linking suffer from low precision, leading to unreliable results, sometimes to the point that imposes human supervision on the predicted links. The low performance gets even more severe when there is a lack of textual information in either commits or issues. Current approaches are also proven computationally expensive. We propose Hybrid-Linker to overcome such limitations by exploiting two information channels; (1) a non-textual-based component that operates on non-textual, automatically recorded information of the commit-issue pairs to predict a link, and (2) a textual-based one which does the same using textual information of the commit-issue pairs. Then, combining the results from the two classifiers, Hybrid-Linker makes the final prediction. Thus, every time one component falls short in predicting a link, the other component fills the gap and improves the results. We evaluate Hybrid-Linker against competing approaches, namely FRLink and DeepLink on a dataset of 12 projects. Hybrid-Linker achieves 90.1 F-measure, respectively. It also outperforms FRLink and DeepLink by 31.3 41.3 improvements in terms of performance as well.

READ FULL TEXT
research
08/10/2021

Issue Link Label Recovery and Prediction for Open Source Software

Modern open source software development heavily relies on the issue trac...
research
01/20/2022

An Alternative Issue Tracking Dataset of Public Jira Repositories

Organisations use issue tracking systems (ITSs) to track and document th...
research
06/02/2020

Descriptions of issues and comments for predicting issue success in software projects

Software development tasks must be performed successfully to achieve sof...
research
06/14/2022

Automated Detection of Typed Links in Issue Trackers

Stakeholders in software projects use issue trackers like JIRA to captur...
research
05/18/2020

Improving the Effectiveness of Traceability Link Recovery using Hierarchical Bayesian Networks

Traceability is a fundamental component of the modern software developme...
research
04/27/2022

Beyond Duplicates: Towards Understanding and Predicting Link Types in Issue Tracking Systems

Software projects use Issue Tracking Systems (ITS) like JIRA to track is...
research
08/21/2023

EALink: An Efficient and Accurate Pre-trained Framework for Issue-Commit Link Recovery

Issue-commit links, as a type of software traceability links, play a vit...

Please sign up or login with your details

Forgot password? Click here to reset