CatIss: An Intelligent Tool for Categorizing Issues Reports using Transformers

03/31/2022
by   Maliheh Izadi, et al.
0

Users use Issue Tracking Systems to keep track and manage issue reports in their repositories. An issue is a rich source of software information that contains different reports including a problem, a request for new features, or merely a question about the software product. As the number of these issues increases, it becomes harder to manage them manually. Thus, automatic approaches are proposed to help facilitate the management of issue reports. This paper describes CatIss, an automatic CATegorizer of ISSue reports which is built upon the Transformer-based pre-trained RoBERTa model. CatIss classifies issue reports into three main categories of Bug reports, Enhancement/feature requests, and Questions. First, the datasets provided for the NLBSE tool competition are cleaned and preprocessed. Then, the pre-trained RoBERTa model is fine-tuned on the preprocessed dataset. Evaluating CatIss on about 80 thousand issue reports from GitHub, indicates that it performs very well surpassing the competition baseline, TicketTagger, and achieving 87.2 F1-score (micro average). Additionally, as CatIss is trained on a wide set of repositories, it is a generic prediction model, hence applicable for any unseen software project or projects with little historical data. Scripts for cleaning the datasets, training CatIss, and evaluating the model are publicly available.

READ FULL TEXT
research
12/20/2020

Predicting the Objective and Priority of Issue Reports in a Cross project Context

Proper documentation plays an important role in successful software mana...
research
06/22/2022

iTiger: An Automatic Issue Title Generation Tool

In both commercial and open-source software, bug reports or issues are u...
research
02/12/2022

Automatic Issue Classifier: A Transfer Learning Framework for Classifying Issue Reports

Issue tracking systems are used in the software industry for the facilit...
research
05/03/2022

Predicting Issue Types with seBERT

Pre-trained transformer models are the current state-of-the-art for natu...
research
03/16/2023

GIRT-Data: Sampling GitHub Issue Report Templates

GitHub's issue reports provide developers with valuable information that...
research
08/31/2023

MaintainoMATE: A GitHub App for Intelligent Automation of Maintenance Activities

Software development projects rely on issue tracking systems at the core...
research
04/04/2019

"Won't We Fix this Issue?" Qualitative Characterization and Automated Identification of Wontfix Issues on GitHub

Addressing users requests in the form of bug reports and Github issues r...

Please sign up or login with your details

Forgot password? Click here to reset