Traceability in the Wild: Automatically Augmenting Incomplete Trace Links

04/06/2018
by   Michael Rath, et al.
0

Software and systems traceability is widely accepted as an essential element for supporting many software development tasks. Today's version control systems provide inbuilt features that allow developers to tag each commit with one or more issue ID, thereby providing the building blocks from which project-wide traceability can be established between feature requests, bug fixes, commits, source code, and specific developers. However, our analysis of six open source projects showed that on average only 60 issues. Without these fundamental links the entire set of project-wide links will be incomplete, and therefore not trustworthy. In this paper we address the fundamental problem of missing links between commits and issues. Our approach leverages a combination of process and text-related features characterizing issues and code changes to train a classifier to identify missing issue tags in commit messages, thereby generating the missing links. We conducted a series of experiments to evaluate our approach against six open source projects and showed that it was able to effectively recommend links for tagging issues at an average of 96 of existing trace links, the classifier returned precision at levels greater than 89

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2018

A Simple NLP-based Approach to Support Onboarding and Retention in Open-Source Communities

Successful open source communities are constantly looking for members an...
research
06/20/2022

PR-SZZ: How pull requests can support the tracing of defects in software repositories

The SZZ algorithm represents a standard way to identify bug fixing commi...
research
08/10/2021

Issue Link Label Recovery and Prediction for Open Source Software

Modern open source software development heavily relies on the issue trac...
research
02/08/2021

Traceability Transformed: Generating more Accurate Links with Pre-Trained BERT Models

Software traceability establishes and leverages associations between div...
research
07/15/2023

Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations

The existence of trace links between artifacts of the software developme...
research
01/22/2019

9.6 Million Links in Source Code Comments: Purpose, Evolution, and Decay

Links are an essential feature of the World Wide Web, and source code re...
research
04/07/2022

Backports: Change Types, Challenges and Strategies

Source code repositories allow developers to manage multiple versions (o...

Please sign up or login with your details

Forgot password? Click here to reset