Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

08/02/2022
by   Fanqi Meng, et al.
0

With the rapid growth of software scale and complexity, a large number of bug reports are submitted to the bug tracking system. In order to speed up defect repair, these reports need to be accurately classified so that they can be sent to the appropriate developers. However, the existing classification methods only use the text information of the bug report, which leads to their low performance. To solve the above problems, this paper proposes a new automatic classification method for bug reports. The innovation is that when categorizing bug reports, in addition to using the text information of the report, the intention of the report (i.e. suggestion or explanation) is also considered, thereby improving the performance of the classification. First, we collect bug reports from four ecosystems (Apache, Eclipse, Gentoo, Mozilla) and manually annotate them to construct an experimental data set. Then, we use Natural Language Processing technology to preprocess the data. On this basis, BERT and TF-IDF are used to extract the features of the intention and the multiple text information. Finally, the features are used to train the classifiers. The experimental result on five classifiers (including K-Nearest Neighbor, Naive Bayes, Logistic Regression, Support Vector Machine, and Random Forest) show that our proposed method achieves better performance and its F-Measure achieves from 87.3

READ FULL TEXT
research
03/03/2021

Root cause prediction based on bug reports

This paper proposes a supervised machine learning approach for predictin...
research
09/18/2017

Bug or Not? Bug Report Classification Using N-Gram IDF

Previous studies have found that a significant number of bug reports are...
research
09/28/2021

What to Prioritize? Natural Language Processing for the Development of a Modern Bug Tracking Solution in Hardware Development

Managing large numbers of incoming bug reports and finding the most crit...
research
10/31/2019

Human-centric Metric for Accelerating Pathology Reports Annotation

Pathology reports contain useful information such as the main involved o...
research
09/29/2018

Towards Better Summarizing Bug Reports with Crowdsourcing Elicited Attributes

Recent years have witnessed the growing demands for resolving numerous b...
research
07/09/2015

Data Mining of Causal Relations from Text: Analysing Maritime Accident Investigation Reports

Text mining is a process of extracting information of interest from text...
research
03/05/2019

Automatic Classification of Pathology Reports using TF-IDF Features

A Pathology report is arguably one of the most important documents in me...

Please sign up or login with your details

Forgot password? Click here to reset