S3M: Siamese Stack (Trace) Similarity Measure

03/18/2021
by   Aleksandr Khvorov, et al.
0

Automatic crash reporting systems have become a de-facto standard in software development. These systems monitor target software, and if a crash occurs they send details to a backend application. Later on, these reports are aggregated and used in the development process to 1) understand whether it is a new or an existing issue, 2) assign these bugs to appropriate developers, and 3) gain a general overview of the application's bug landscape. The efficiency of report aggregation and subsequent operations heavily depends on the quality of the report similarity metric. However, a distinctive feature of this kind of report is that no textual input from the user (i.e., bug description) is available: it contains only stack trace information. In this paper, we present S3M ("extreme") – the first approach to computing stack trace similarity based on deep learning. It is based on a siamese architecture that uses a biLSTM encoder and a fully-connected classifier to compute similarity. Our experiments demonstrate the superiority of our approach over the state-of-the-art on both open-sourced data and a private JetBrains dataset. Additionally, we review the impact of stack trace trimming on the quality of the results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2020

TraceSim: A Method for Calculating Stack Trace Similarity

Many contemporary software products have subsystems for automatic crash ...
research
04/30/2022

Aggregation of Stack Trace Similarities for Crash Report Deduplication

The automatic collection of stack traces in bug tracking systems is an i...
research
01/04/2018

DeepTriage: Exploring the Effectiveness of Deep Learning for Bug Triaging

For a given software bug report, identifying an appropriate developer wh...
research
08/17/2023

A Comparative Study of Text Embedding Models for Semantic Text Similarity in Bug Reports

Bug reports are an essential aspect of software development, and it is c...
research
01/14/2022

DapStep: Deep Assignee Prediction for Stack Trace Error rePresentation

The task of finding the best developer to fix a bug is called bug triage...
research
07/15/2023

Improving Trace Link Recommendation by Using Non-Isotropic Distances and Combinations

The existence of trace links between artifacts of the software developme...
research
02/28/2018

Orion+: Automated Problem Diagnosis in Computing Systems by Mining Metric Data

This work presents the suspicious code at a finer granularity of call st...

Please sign up or login with your details

Forgot password? Click here to reset