Building the Collaboration Graph of Open-Source Software Ecosystem

by   Elena Lyulina, et al.

The Open-Source Software community has become the center of attention for many researchers, who are investigating various aspects of collaboration in this extremely large ecosystem. Due to its size, it is difficult to grasp whether or not it has structure, and if so, what it may be. Our hackathon project aims to facilitate the understanding of the developer collaboration structure and relationships among projects based on the bi-graph of what projects developers contribute to by providing an interactive collaboration graph of this ecosystem, using the data obtained from World of Code infrastructure. Our attempts to visualize the entirety of projects and developers were stymied by the inability of the layout and visualization tools to process the exceedingly large scale of the full graph. We used WoC to filter the nodes (developers and projects) and edges (developer contributions to a project) to reduce the scale of the graph that made it amenable to an interactive visualization and published the resulting visualizations. We plan to apply hierarchical approaches to be able to incorporate the entire data in the interactive visualizations and also to evaluate the utility of such visualizations for several tasks.



There are no comments yet.


page 2


On Developers' Personality in Large-scale Distributed Projects: The Case of the Apache Ecosystem

Large-scale distributed projects are typically the results of collective...

Visualization of Contributions to Open-Source Projects

We want to analyze visually, to what extend team members and external de...

Evolving Collaboration, Dependencies, and Use in the Rust Open Source Software Ecosystem

Open-source software (OSS) is widely spread in industry, research, and g...

Teddy: Automatic Recommendation of Pythonic Idiom Usage For Pull-Based Software Projects

Pythonic code is idiomatic code that follows guiding principles and prac...

World of Code: Enabling a Research Workflow for Mining and Analyzing the Universe of Open Source VCS data

Open source software (OSS) is essential for modern society and, while su...

A Preliminary Theory for Open Source Ecosystem Micro-economics

While there has been substantial empirical work identifying factors that...

PlotXY: a high quality plotting system for the Herschel Interactive Processing Environment (HIPE), and the astronomical community

The Herschel Interactive Processing Environment (HIPE) was developed by ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.