A Systematic Mapping of Software Engineering Challenges: GHTorrent Case

03/24/2020
by   Abdulkadir Seker, et al.
0

Git is used as the distributed version control system for many open-source software projects. One Git-based service, GitHub, is the most common code hosting and repository service for open-source software projects. For researchers that study software engineering, the content that is hosted on these platforms provides much valuable data. There are some alternatives to get GitHub data such as GitHub Archive, GitHub API or GHTorrent. Among these options, GHTorrent is the most widely known and used GitHub dataset in the literature. Although there are some review studies about software engineering challenges across the GitHub platform, no review of GHTorrent dataset-specific research is available. In this study, the 172 studies that use GHTorrent as a data source were categorized within the scope of software engineering challenges and a systematic mapping study was carried out. Moreover, the pros and cons of the dataset have been indicated and the focused issues of the literature on and the open challenges have been noted.

READ FULL TEXT

page 5

page 6

page 7

page 8

page 9

page 10

page 11

page 12

research
06/08/2020

Summarising Big Data: Common GitHub Dataset for Software Engineering Challenges

In open-source software development environments; textual, numerical and...
research
04/05/2018

Metrics Dashboard: A Hosted Platform for Software Quality Metrics

There is an emerging consensus in the scientific software community that...
research
02/23/2023

Automatic Detecting Unethical Behavior in Open-source Software Projects

Given the rapid growth of Open-Source Software (OSS) projects, ethical c...
research
05/18/2023

Patterns in Docker Compose Multi-Container Orchestration

Software design patterns present general code solutions to common softwa...
research
07/13/2021

Promises and Perils of Inferring Personality on GitHub

Personality plays a pivotal role in our understanding of human actions a...
research
02/17/2022

QuerTCI: A Tool Integrating GitHub Issue Querying with Comment Classification

Issue tracking systems enable users and developers to comment on problem...

Please sign up or login with your details

Forgot password? Click here to reset