A deep tree-based model for software defect prediction

02/03/2018
by   Hoa Khanh Dam, et al.
0

Defects are common in software systems and can potentially cause various problems to software users. Different methods have been developed to quickly predict the most likely locations of defects in large code bases. Most of them focus on designing features (e.g. complexity metrics) that correlate with potentially defective code. Those approaches however do not sufficiently capture the syntax and different levels of semantics of source code, an important capability for building accurate prediction models. In this paper, we develop a novel prediction model which is capable of automatically learning features for representing source code and using them for defect prediction. Our prediction system is built upon the powerful deep learning, tree-structured Long Short Term Memory network which directly matches with the Abstract Syntax Tree representation of source code. An evaluation on two datasets, one from open source projects contributed by Samsung and the other from the public PROMISE repository, demonstrates the effectiveness of our approach for both within-project and cross-project predictions.

READ FULL TEXT
research
08/08/2017

Automatic feature learning for vulnerability prediction

Code flaws or vulnerabilities are prevalent in software systems and can ...
research
04/06/2023

A Unified Active Learning Framework for Annotating Graph Data with Application to Software Source Code Performance Prediction

Most machine learning and data analytics applications, including perform...
research
06/19/2019

Automatic Source Code Summarization with Extended Tree-LSTM

Neural machine translation models are used to automatically generate a d...
research
07/30/2016

DeepSoft: A vision for a deep model of software

Although software analytics has experienced rapid growth as a research a...
research
03/26/2018

code2vec: Learning Distributed Representations of Code

We present a neural model for representing snippets of code as continuou...
research
02/12/2022

Revisiting the Impact of Dependency Network Metrics on Software Defect Prediction

Software dependency network metrics extracted from the dependency graph ...
research
09/02/2016

A deep learning model for estimating story points

Although there has been substantial research in software analytics for e...

Please sign up or login with your details

Forgot password? Click here to reset