The Impact of IR-based Classifier Configuration on the Performance and the Effort of Method-Level Bug Localization

Context: IR-based bug localization is a classifier that assists developers in locating buggy source code entities (e.g., files and methods) based on the content of a bug report. Such IR-based classifiers have various parameters that can be configured differently (e.g., the choice of entity representation). Objective: In this paper, we investigate the impact of the choice of the IR-based classifier configuration on the top-k performance and the required effort to examine source code entities before locating a bug at the method level. Method: We execute a large space of classifier configuration, 3,172 in total, on 5,266 bug reports of two software systems, i.e., Eclipse and Mozilla. Results: We find that (1) the choice of classifier configuration impacts the top-k performance from 0.44 50,000 LOC; (2) classifier configurations with similar top-k performance might require different efforts; (3) VSM achieves both the best top-k performance and the least required effort for method-level bug localization; (4) the likelihood of randomly picking a configuration that performs within 20 classifier configuration is on average 5.4 average 1 analyzed data have the most impact on both the top-k performance and the required effort; and (6) the most efficient classifier configuration obtained at the method-level can also be used at the file-level (and vice versa). Conclusion: Our results lead us to conclude that configuration has a large impact on both the top-k performance and the required effort for method-level bug localization, suggesting that the IR-based configuration settings should be carefully selected and the required effort metric should be included in future bug localization studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2021

BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization

Locating bugs is an important, but effort-intensive and time-consuming t...
research
04/22/2021

An Extensive Study on Smell-Aware Bug Localization

Bug localization is an important aspect of software maintenance because ...
research
08/08/2018

A Case Study on the Impact of Similarity Measure on Information Retrieval based Software Engineering Tasks

Information Retrieval (IR) plays a pivotal role in diverse Software Engi...
research
07/20/2018

Poster: Improving Bug Localization with Report Quality Dynamics and Query Reformulation

Recent findings from a user study suggest that IR-based bug localization...
research
02/07/2019

D&C: A Divide-and-Conquer Approach to IR-based Bug Localization

Many automated tasks in software maintenance rely on information retriev...
research
02/27/2018

Network-Clustered Multi-Modal Bug Localization

Developers often spend much effort and resources to debug a program. To ...
research
10/20/2020

Industry-scale IR-based Bug Localization: A Perspective from Facebook

We explore the application of Information Retrieval (IR) based bug local...

Please sign up or login with your details

Forgot password? Click here to reset