Investigation of Dataset Features for Just-in-Time Defect Prediction

09/25/2021
by   Giuseppe Ng, et al.
0

Just-in-time (JIT) defect prediction refers to the technique of predicting whether a code change is defective. Many contributions have been made in this area through the excellent dataset by Kamei. In this paper, we revisit the dataset and highlight preprocessing difficulties with the dataset and the limitations of the dataset on unsupervised learning. Secondly, we propose certain features in the Kamei dataset that can be used for training models. Lastly, we discuss the limitations of the dataset's features.

READ FULL TEXT

page 1

page 2

page 3

page 4

10/02/2021

The Need for a Fine-grained approach in Just-in-Time Defect Prediction

With software system complexity leading to the rise of software defects,...
03/12/2021

JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction

A Just-In-Time (JIT) defect prediction model is a classifier to predict ...
02/28/2022

ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction

In this paper, we present ApacheJIT, a large dataset for Just-In-Time de...
03/02/2022

Supervised Hebbian learning: toward eXplainable AI

In neural network's Literature, Hebbian learning traditionally refers to...
07/26/2019

Exploiting new forms of data to study the private rented sector: strengths and limitations of a database of rental listings

Reviews of official statistics for UK housing have noted that developmen...
10/07/2020

Improving the efficiency of spectral features extraction by structuring the audio files

The extraction of spectral features from a music clip is a computational...
08/08/2022

Learning to Learn to Predict Performance Regressions in Production at Meta

Catching and attributing code change-induced performance regressions in ...