Bug or Not? Bug Report Classification Using N-Gram IDF

09/18/2017
by   Pannavat Terdchanakul, et al.
0

Previous studies have found that a significant number of bug reports are misclassified between bugs and non-bugs, and that manually classifying bug reports is a time-consuming task. To address this problem, we propose a bug reports classification model with N-gram IDF, a theoretical extension of Inverse Document Frequency (IDF) for handling words and phrases of any length. N-gram IDF enables us to extract key terms of any length from texts, these key terms can be used as the features to classify bug reports. We build classification models with logistic regression and random forest using features from N-gram IDF and topic modeling, which is widely used in various software engineering tasks. With a publicly available dataset, our results show that our N-gram IDF-based models have a superior performance than the topic-based models on all of the evaluated cases. Our models show promising results and have a potential to be extended to other software engineering tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2022

Automatic Classification of Bug Reports Based on Multiple Text Information and Reports' Intention

With the rapid growth of software scale and complexity, a large number o...
research
04/26/2021

DABT: A Dependency-aware Bug Triaging Method

In software engineering practice, fixing a bug promptly reduces the asso...
research
03/29/2019

Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering

Tracking user reported bugs requires considerable engineering effort in ...
research
04/27/2019

Sentiment Classification using N-gram IDF and Automated Machine Learning

We propose a sentiment classification method with a general machine lear...
research
12/27/2021

Evaluating Software User Feedback Classifiers on Unseen Apps, Datasets, and Metadata

Listening to user's requirements is crucial to building and maintaining ...
research
12/28/2021

Fast Changeset-based Bug Localization with BERT

Automatically localizing software bugs to the changesets that induced th...
research
08/21/2023

PrAIoritize: Learning to Prioritize Smart Contract Bugs and Vulnerabilities

Smart contract vulnerabilities and bugs have become a key concern for so...

Please sign up or login with your details

Forgot password? Click here to reset