DapStep: Deep Assignee Prediction for Stack Trace Error rePresentation

01/14/2022
by   Denis Sushentsev, et al.
0

The task of finding the best developer to fix a bug is called bug triage. Most of the existing approaches consider the bug triage task as a classification problem, however, classification is not appropriate when the sets of classes change over time (as developers often do in a project). Furthermore, to the best of our knowledge, all the existing models use textual sources of information, i.e., bug descriptions, which are not always available. In this work, we explore the applicability of existing solutions for the bug triage problem when stack traces are used as the main data source of bug reports. Additionally, we reformulate this task as a ranking problem and propose new deep learning models to solve it. The models are based on a bidirectional recurrent neural network with attention and on a convolutional neural network, with the weights of the models optimized using a ranking loss function. To improve the quality of ranking, we propose using additional information from version control system annotations. Two approaches are proposed for extracting features from annotations: manual and using an additional neural network. To evaluate our models, we collected two datasets of real-world stack traces. Our experiments show that the proposed models outperform existing models adapted to handle stack traces. To facilitate further research in this area, we publish the source code of our models and one of the collected datasets.

READ FULL TEXT
research
01/04/2018

DeepTriage: Exploring the Effectiveness of Deep Learning for Bug Triaging

For a given software bug report, identifying an appropriate developer wh...
research
04/17/2020

Can We Use Stack Overflow as a Source of Explainable Bug-fix Data?

Bug-fix data sets are important for building various software engineerin...
research
07/08/2018

Automated labeling of bugs and tickets using attention-based mechanisms in recurrent neural networks

We explore solutions for automated labeling of content in bug trackers a...
research
04/30/2022

Aggregation of Stack Trace Similarities for Crash Report Deduplication

The automatic collection of stack traces in bug tracking systems is an i...
research
02/28/2018

Orion+: Automated Problem Diagnosis in Computing Systems by Mining Metric Data

This work presents the suspicious code at a finer granularity of call st...
research
10/31/2018

Making root cause analysis feasible for large code bases: a solution approach for a climate model

Applications that simulate complex physical processes can be composed of...
research
03/18/2021

S3M: Siamese Stack (Trace) Similarity Measure

Automatic crash reporting systems have become a de-facto standard in sof...

Please sign up or login with your details

Forgot password? Click here to reset