Data Preparation for Software Vulnerability Prediction: A Systematic Literature Review

09/13/2021
by   Roland Croft, et al.
0

Software Vulnerability Prediction (SVP) is a data-driven technique for software quality assurance that has recently gained considerable attention in the Software Engineering research community. However, the difficulties of preparing Software Vulnerability (SV) related data remains as the main barrier to industrial adoption. Despite this problem, there have been no systematic efforts to analyse the existing SV data preparation techniques and challenges. Without such insights, we are unable to overcome the challenges and advance this research domain. Hence, we are motivated to conduct a Systematic Literature Review (SLR) of SVP research to synthesize and gain an understanding of the data considerations, challenges and solutions that SVP researchers provide. From our set of primary studies, we identify the main practices for each data preparation step. We then present a taxonomy of 16 key data challenges relating to six themes, which we further map to six categories of solutions. However, solutions are far from complete, and there are several ill-considered issues. We also provide recommendations for future areas of SV data research. Our findings help illuminate the key SV data practices and considerations for SVP researchers and practitioners, as well as inform the validity of the current SVP approaches.

READ FULL TEXT

page 7

page 8

research
04/22/2020

Code Smells and Refactoring: A Tertiary Systematic Review of Challenges and Observations

In this paper, we present a tertiary systematic literature review of pre...
research
12/01/2018

Fuzzing: Art, Science, and Engineering

Among the many software vulnerability discovery techniques available tod...
research
01/13/2023

Data Quality for Software Vulnerability Datasets

The use of learning-based techniques to achieve automated software vulne...
research
03/23/2022

Runtime Software Patching: Taxonomy, Survey and Future Directions

Runtime software patching aims to minimize or eliminate service downtime...
research
07/18/2021

A Survey on Data-driven Software Vulnerability Assessment and Prioritization

Software Vulnerabilities (SVs) are increasing in complexity and scale, p...
research
04/06/2022

Advancing Data Justice Research and Practice: An Integrated Literature Review

The Advancing Data Justice Research and Practice (ADJRP) project aims to...

Please sign up or login with your details

Forgot password? Click here to reset