Debugging Crashes using Continuous Contrast Set Mining

11/12/2019
by   Rebecca Qian, et al.
0

Facebook operates a family of services used by over two billion people daily on a huge variety of mobile devices. Many devices are configured to upload crash reports should the app crash for any reason. Engineers monitor and triage millions of crash reports logged each day to check for bugs, regressions, and any other quality problems. Debugging groups of crashes is a manually intensive process that requires deep domain expertise and close inspection of traces and code, often under time constraints. We use contrast set mining, a form of discriminative pattern mining, to learn what distinguishes one group of crashes from another. Prior works focus on discretization to apply contrast mining to continuous data. We propose the first direct application of contrast learning to continuous data, without the need for discretization. We also define a weighted anomaly score that unifies continuous and categorical contrast sets while mitigating bias, as well as uncertainty measures that communicate confidence to developers. We demonstrate the value of our novel statistical improvements by applying it on a challenging dataset from Facebook production logs, where we achieve 40x speedup over baseline approaches using discretization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Scalable Statistical Root Cause Analysis on App Telemetry

Despite engineering workflows that aim to prevent buggy code from being ...
research
02/16/2019

Getafix: Learning to fix bugs automatically

Static analyzers, including linters, can warn developers about programmi...
research
08/10/2017

Mobile Academy: A Ubiquitous Mobile Learning (mLearning) Platform

The paper reports on an ongoing research project into the development of...
research
07/08/2023

ReviewRanker: A Semi-Supervised Learning Based Approach for Code Review Quality Estimation

Code review is considered a key process in the software industry for min...
research
02/19/2021

Prioritize Crowdsourced Test Reports via Deep Screenshot Understanding

Crowdsourced testing is increasingly dominant in mobile application (app...
research
04/27/2020

An Empirical Study on Feature Discretization

When dealing with continuous numeric features, we usually adopt feature ...
research
01/07/2020

With Registered Reports Towards Large Scale Data Curation

The scale of manually validated data is currently limited by the effort ...

Please sign up or login with your details

Forgot password? Click here to reset