Community Formation and Detection on GitHub Collaboration Networks

09/23/2021
by   Behnaz Moradi-Jamei, et al.
0

This paper studies community formation in OSS collaboration networks. While most current work examines the emergence of small-scale OSS projects, our approach draws on a large-scale historical dataset of 1.8 million GitHub users and their repository contributions. OSS collaborations are characterized by small groups of users that work closely together, leading to the presence of communities defined by short cycles in the underlying network structure. To understand the impact of this phenomenon, we apply a pre-processing step that accounts for the cyclic network structure by using Renewal-Nonbacktracking Random Walks (RNBRW) and the strength of pairwise collaborations before implementing the Louvain method to identify communities within the network. Equipping Louvain with RNBRW and the contribution strength provides a more assertive approach for detecting small-scale teams and reveals nontrivial differences in community detection such as users tendencies toward preferential attachment to more established collaboration communities. Using this method, we also identify key factors that affect community formation, including the effect of users location and primary programming language, which was determined using a comparative method of contribution activities. Overall, this paper offers several promising methodological insights for both open-source software experts and network scholars interested in studying team formation.

READ FULL TEXT
research
10/02/2019

A new method for quantifying network cyclic structure to improve community detection

A distinguishing property of communities in networks is that cycles are ...
research
08/06/2020

Gathering GitHub OSS Requirements from Q A Community: an Empirical Study

Cross-community collaboration can exploit the expertise and knowledges o...
research
04/21/2023

Constructing Temporal Networks of OSS Programming Language Ecosystems

One of the primary factors that encourage developers to contribute to op...
research
03/11/2023

Prospecting Community Development Strength based on Economic Graph: From Categorization to Scoring

Recent years have witnessed a growing number of researches on community ...
research
06/02/2020

Analyzing programming languages by community characteristics on Github and StackOverflow

The choice of programming language is a very important decision as it no...
research
09/14/2023

Using network metrics to explore the community structure that underlies movement patterns

This work aims to explore the community structure of Santiago de Chile b...
research
10/13/2021

Diversity of Skills and Collective Intelligence in GitHub

A common assumption suggests that individuals tend to work with others w...

Please sign up or login with your details

Forgot password? Click here to reset