Using Consensual Biterms from Text Structures of Requirements and Code to Improve IR-Based Traceability Recovery

09/05/2022
by   Hui Gao, et al.
0

Traceability approves trace links among software artifacts based on whether two artifacts are related by system functionalities. The traces are valuable for software development, but are difficult to obtain manually. To cope with the costly and fallible manual recovery, automated approaches are proposed to recover traces through textual similarities among software artifacts, such as those based on Information Retrieval (IR). However, the low quality quantity of artifact texts negatively impact the calculated IR values, thus greatly hindering the performance of IR-based approaches. In this study, we propose to extract co-occurred word pairs from the text structures of both requirements and code (i.e., consensual biterms) to improve IR-based traceability recovery. We first collect a set of biterms based on the part-of-speech of requirement texts, and then filter them through the code texts. We then use these consensual biterms to both enrich the input corpus for IR techniques and enhance the calculations of IR values. A nine-system-based evaluation shows that in general, when solely used to enhance IR techniques, our approach can outperform pure IR-based approaches and another baseline by 21.9 AP, and 9.3 with another enhancing strategy from different perspectives, it can outperform this baseline by 5.9

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2022

Enhancing Automated Software Traceability by Transfer Learning from Open-World Data

Software requirements traceability is a critical component of the softwa...
research
07/17/2018

Automatic Traceability Maintenance via Machine Learning Classification

Previous studies have shown that software traceability, the ability to l...
research
07/27/2021

Yet Another Combination of IR- and Neural-based Comment Generation

Code comment generation techniques aim to generate natural language desc...
research
05/18/2020

Improving the Effectiveness of Traceability Link Recovery using Hierarchical Bayesian Networks

Traceability is a fundamental component of the modern software developme...
research
06/13/2019

An IR-based Approach Towards Automated Integration of Geo-spatial Datasets in Map-based Software Systems

Data is arguably the most valuable asset of the modern world. In this er...
research
04/18/2023

BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information Retrieval

Efficient information retrieval (IR) from building information models (B...
research
09/07/2018

Challenges for Measuring Usefulness of Interactive IR Systems with Log-based Approaches

The usefulness evaluation model proposed by Cole et al. in 2009 [2] focu...

Please sign up or login with your details

Forgot password? Click here to reset