Investigation of Dataset Features for Just-in-Time Defect Prediction

09/25/2021
by   Giuseppe Ng, et al.
0

Just-in-time (JIT) defect prediction refers to the technique of predicting whether a code change is defective. Many contributions have been made in this area through the excellent dataset by Kamei. In this paper, we revisit the dataset and highlight preprocessing difficulties with the dataset and the limitations of the dataset on unsupervised learning. Secondly, we propose certain features in the Kamei dataset that can be used for training models. Lastly, we discuss the limitations of the dataset's features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2021

The Need for a Fine-grained approach in Just-in-Time Defect Prediction

With software system complexity leading to the rise of software defects,...
research
03/12/2021

JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction

A Just-In-Time (JIT) defect prediction model is a classifier to predict ...
research
02/28/2022

ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction

In this paper, we present ApacheJIT, a large dataset for Just-In-Time de...
research
03/02/2022

Supervised Hebbian learning: toward eXplainable AI

In neural network's Literature, Hebbian learning traditionally refers to...
research
09/28/2022

Feature Sets in Just-in-Time Defect Prediction: An Empirical Evaluation

Just-in-time defect prediction assigns a defect risk to each new change ...
research
07/26/2019

Exploiting new forms of data to study the private rented sector: strengths and limitations of a database of rental listings

Reviews of official statistics for UK housing have noted that developmen...
research
10/07/2020

Improving the efficiency of spectral features extraction by structuring the audio files

The extraction of spectral features from a music clip is a computational...

Please sign up or login with your details

Forgot password? Click here to reset