WELL: Applying Bug Detectors to Bug Localization via Weakly Supervised Learning

05/27/2023
by   Zhuo Li, et al.
0

Bug localization, which is used to help programmers identify the location of bugs in source code, is an essential task in software development. Researchers have already made efforts to harness the powerful deep learning (DL) techniques to automate it. However, training bug localization model is usually challenging because it requires a large quantity of data labeled with the bug's exact location, which is difficult and time-consuming to collect. By contrast, obtaining bug detection data with binary labels of whether there is a bug in the source code is much simpler. This paper proposes a WEakly supervised bug LocaLization (WELL) method, which only uses the bug detection data with binary labels to train a bug localization model. With CodeBERT finetuned on the buggy-or-not binary labeled data, WELL can address bug localization in a weakly supervised manner. The evaluations on three method-level synthetic datasets and one file-level real-world dataset show that WELL is significantly better than the existing SOTA model in typical bug localization tasks such as variable misuse and other programming bugs.

READ FULL TEXT
research
08/29/2021

BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization

Locating bugs is an important, but effort-intensive and time-consuming t...
research
04/30/2018

DeepBugs: A Learning Approach to Name-based Bug Detection

Natural language elements in source code, e.g., the names of variables a...
research
03/27/2023

Machine Learning for Microprocessor Performance Bug Localization

The validation process for microprocessors is a very complex task that c...
research
11/28/2019

Using Distributed Representation of Code for Bug Detection

Recent advances in neural modeling for bug detection have been very prom...
research
02/28/2023

Large-Scale Evaluation of Method-Level Bug Localization with FinerBench4BL

Bug localization is an important aspect of software maintenance because ...
research
03/03/2021

A Fault Localization and Debugging Support Framework driven by Bug Tracking Data

Fault localization has been determined as a major resource factor in the...
research
04/21/2022

On Distribution Shift in Learning-based Bug Detectors

Deep learning has recently achieved initial success in program analysis ...

Please sign up or login with your details

Forgot password? Click here to reset