BuGL – A Cross-Language Dataset for Bug Localization

04/19/2020
by   Sandeep Muvva, et al.
0

Bug Localization is the process of locating potential error-prone files or methods from a given bug report and source code. There is extensive research on bug localization in the literature that focuses on applying information retrieval techniques or machine learning/deep learning approaches or both, to detect location of bugs. The common premise for all approaches is the availability of a good dataset, which in this case, is the standard benchmark dataset that comprises of 6 Java projects and in some cases, more than 6 Java projects. The existing dataset do not comprise projects of other programming languages, despite of the need to investigate specific and cross project bug localization. To the best of our knowledge, we are not aware of any dataset that addresses this concern. In this paper, we present BuGL, a large-scale cross-language dataset. BuGL constitutes of more than 10,000 bug reports drawn from open-source projects written in four programming languages, namely C, C++, Java, and Python. The dataset consists of information which includes Bug Reports and Pull-Requests. BuGL aims to unfold new research opportunities in the area of bug localization.

READ FULL TEXT
research
11/06/2020

DRAST – A Deep Learning and AST Based Approach for Bug Localization

Context: Given a bug report and source code of the project, bug localiza...
research
03/21/2018

Estimating defectiveness of source code: A predictive model using GitHub content

Two key contributions presented in this paper are: i) A method for build...
research
05/09/2023

RLocator: Reinforcement Learning for Bug Localization

Software developers spend a significant portion of time fixing bugs in t...
research
09/08/2021

BLESER: Bug Localization Based on Enhanced Semantic Retrieval

Static bug localization techniques that locate bugs at method granularit...
research
06/14/2021

IncBL: Incremental Bug Localization

Numerous efforts have been invested in improving the effectiveness of bu...
research
01/03/2018

How Does Bug-Handling Effort Differ Among Different Programming Languages?

Handling bugs is an essential part of software development. The impact o...
research
07/19/2019

On Usefulness of the Deep-Learning-Based Bug Localization Models to Practitioners

Background: Developers spend a significant amount of time and efforts to...

Please sign up or login with your details

Forgot password? Click here to reset