GIRT-Data: Sampling GitHub Issue Report Templates

03/16/2023
by   Nafiseh Nikeghbal, et al.
0

GitHub's issue reports provide developers with valuable information that is essential to the evolution of a software development project. Contributors can use these reports to perform software engineering tasks like submitting bugs, requesting features, and collaborating on ideas. In the initial versions of issue reports, there was no standard way of using them. As a result, the quality of issue reports varied widely. To improve the quality of issue reports, GitHub introduced issue report templates (IRTs), which pre-fill issue descriptions when a new issue is opened. An IRT usually contains greeting contributors, describing project guidelines, and collecting relevant information. However, despite of effectiveness of this feature which was introduced in 2016, only nearly 5 stars) utilize it. There are currently few articles on IRTs, and the available ones only consider a small number of repositories. In this work, we introduce GIRT-Data, the first and largest dataset of IRTs in both YAML and Markdown format. This dataset and its corresponding open-source crawler tool are intended to support research in this area and to encourage more developers to use IRTs in their repositories. The stable version of the dataset contains 1,084,300 repositories and 50,032 of them support IRTs. The stable version of the dataset and crawler is available here: https://github.com/kargaranamir/girt-data

READ FULL TEXT
research
04/04/2019

Bounties in Open Source Development on GitHub: A Case Study of Bountysource Bounties

Due to the voluntary nature of open source software, it can be hard to f...
research
12/20/2021

How Do Developers Deal with Security Issue Reports on GitHub?

Security issue reports are the primary means of informing development te...
research
10/21/2019

Trouble with the Curve: Predicting Future MLB Players Using Scouting Reports

In baseball, a scouting report profiles a player's characteristics and t...
research
03/31/2022

CatIss: An Intelligent Tool for Categorizing Issues Reports using Transformers

Users use Issue Tracking Systems to keep track and manage issue reports ...
research
12/20/2020

Predicting the Objective and Priority of Issue Reports in a Cross project Context

Proper documentation plays an important role in successful software mana...
research
06/06/2023

Using Screenshot Attachments in Issue Reports for Triaging

In previous work, we deployed IssueTAG, which uses the texts present in ...
research
10/04/2022

Multifaceted Hierarchical Report Identification for Non-Functional Bugs in Deep Learning Frameworks

Non-functional bugs (e.g., performance- or accuracy-related bugs) in Dee...

Please sign up or login with your details

Forgot password? Click here to reset