Industry-scale IR-based Bug Localization: A Perspective from Facebook

10/20/2020
by   Vijayaraghavan Murali, et al.
0

We explore the application of Information Retrieval (IR) based bug localization methods at a large industrial setting, Facebook. Facebook's code base evolves rapidly, with thousands of code changes being committed to a monolithic repository every day. When a bug is detected, it is often time-sensitive and imperative to identify the commit causing the bug in order to either revert it or fix it. This is complicated by the fact that bugs often manifest with complex and unwieldy features, such as stack traces and other metadata. Code commits also have various features associated with them, ranging from developer comments to test results. This poses unique challenges to bug localization methods, making it a highly non-trivial operation. In this paper we lay out several practical concerns for industry-level IR-based bug localization, and propose Bug2Commit, a tool that is designed to address these concerns. We also assess the effectiveness of existing IR-based localization techniques from the software engineering community, and find that in the presence of complex queries or documents, which are common at Facebook, existing approaches do not perform as well as Bug2Commit. We evaluate Bug2Commit on three applications at Facebook: client-side crashes from the mobile app, server-side performance regressions, and mobile simulation tests for performance. We find that Bug2Commit outperforms the accuracy of existing approaches by up to 17 savings in compute usage for simulations.

READ FULL TEXT
research
08/29/2021

BoostNSift: A Query Boosting and Code Sifting Technique for Method Level Bug Localization

Locating bugs is an important, but effort-intensive and time-consuming t...
research
04/22/2021

An Extensive Study on Smell-Aware Bug Localization

Bug localization is an important aspect of software maintenance because ...
research
07/20/2018

Poster: Improving Bug Localization with Report Quality Dynamics and Query Reformulation

Recent findings from a user study suggest that IR-based bug localization...
research
08/11/2021

The Forgotten Role of Search Queries in IR-based Bug Localization: An Empirical Study

Being light-weight and cost-effective, IR-based approaches for bug local...
research
06/20/2018

The Impact of IR-based Classifier Configuration on the Performance and the Effort of Method-Level Bug Localization

Context: IR-based bug localization is a classifier that assists develope...
research
10/31/2018

Making root cause analysis feasible for large code bases: a solution approach for a climate model

Applications that simulate complex physical processes can be composed of...
research
11/09/2020

First Infrastructure and Experimentation in Echo-debugging

As applications get developed, bugs inevitably get introduced. Often, it...

Please sign up or login with your details

Forgot password? Click here to reset