Pull Request Decision Explained: An Empirical Overview

by   Xunhui Zhang, et al.

Context: Pull-based development model is widely used in open source, leading the trends in distributed software development. One aspect which has garnered significant attention is studies on pull request decision - identifying factors for explanation. Objective: This study builds on a decade long research on pull request decision to explain it. We empirically investigate how factors influence pull request decision and scenarios that change the influence of factors. Method: We identify factors influencing pull request decision on GitHub through a systematic literature review and infer it by mining archival data. We collect a total of 3,347,937 pull requests with 95 features from 11,230 diverse projects on GitHub. Using this data, we explore the relations of the factors to each other and build mixed-effect logistic regression models to empirically explain pull request decision. Results: Our study shows that a small number of factors explain pull request decision with the integrator same or different from the submitter as the most important factor. We also noted that some factors are important only in special cases e.g., the percentage of failed builds is important for pull request decision when continuous integration is used.



There are no comments yet.


page 3

page 8

page 16

page 17

page 18


Pull Request Latency Explained: An Empirical Overview

Pull request latency evaluation is an essential application of effort ev...

On the impact of pull request decisions on future contributions

The pull-based development process has become prevalent on platforms suc...

Does Code Quality Affect Pull Request Acceptance? An empirical study

Background. Pull requests are a common practice for contributing and rev...

Do Small Code Changes Merge Faster? A Multi-Language Empirical Investigation

Code velocity, or the speed with which code changes are integrated into ...

Replication Can Improve Prior Results: A GitHub Study of Pull Request Acceptance

Crowdsourcing and data mining can be used to effectively reduce the effo...

Evaluating a bot detection model on git commit messages

Detecting the presence of bots in distributed software development activ...

Effect of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem

Pull request (PR) based development, which is a norm for the social codi...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.