Advaita: Bug Duplicity Detection System

01/24/2020
by   Amit Kumar, et al.
0

Bugs are prevalent in software development. To improve software quality, bugs are filed using a bug tracking system. Properties of a reported bug would consist of a headline, description, project, product, component that is affected by the bug and the severity of the bug. Duplicate bugs rate ( duplicate bugs) are in the range from single digit (1 to 9 (40 working on the project. Duplicate bugs range are between 9 the open source projects like Eclipse, Firefox etc. Detection of duplicity deals with identifying whether any two bugs convey the same meaning. This detection of duplicates helps in de-duplication. Detecting duplicate bugs help reduce triaging efforts and saves time for developers in fixing the issues. Traditional natural language processing techniques are less accurate in identifying similarity between sentences. Using the bug data present in a bug tracking system, various approaches were explored including several machine learning algorithms, to obtain a viable approach that can identify duplicate bugs, given a pair of sentences(i.e. the respective bug descriptions). This approach considers multiple sets of features viz. basic text statistical features, semantic features and contextual features. These features are extracted from the headline, description and component and are subsequently used to train a classification algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Is this bug severe? A text-cum-graph based model for bug severity prediction

Repositories of large software systems have become commonplace. This mas...
research
09/12/2023

PreciseBugCollector: Extensible, Executable and Precise Bug-fix Collection

Bug datasets are vital for enabling deep learning techniques to address ...
research
11/10/2020

Wayback Machine: Capturing the evolutionary behaviour of the bug dependency graph in open-source software systems

The issue tracking system (ITS) is a rich data source for data-driven de...
research
09/03/2019

A Bug or a Suggestion? An Automatic Way to Label Issues

More and more users and developers are using Issue Tracking Systems (ITS...
research
04/20/2022

BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats

In community-based software development, developers frequently rely on l...
research
12/15/2020

A Quantitative Study of Security Bug Fixes of GitHub Repositories

Software is prone to bugs and failures. Security bugs are those that exp...
research
10/07/2022

Understanding and Supporting Debugging Workflows in Multiverse Analysis

Multiverse analysis-a paradigm for statistical analysis that considers a...

Please sign up or login with your details

Forgot password? Click here to reset