DeepAI AI Chat
Log In Sign Up

LAGOON: An Analysis Tool for Open Source Communities

by   Sourya Dey, et al.
Galois, Inc.

This paper presents LAGOON – an open source platform for understanding the complex ecosystems of Open Source Software (OSS) communities. The platform currently utilizes spatiotemporal graphs to store and investigate the artifacts produced by these communities, and help analysts identify bad actors who might compromise an OSS project's security. LAGOON provides ingest of artifacts from several common sources, including source code repositories, issue trackers, mailing lists and scraping content from project websites. Ingestion utilizes a modular architecture, which supports incremental updates from data sources and provides a generic identity fusion process that can recognize the same community members across disparate accounts. A user interface is provided for visualization and exploration of an OSS project's complete sociotechnical graph. Scripts are provided for applying machine learning to identify patterns within the data. While current focus is on the identification of bad actors in the Python community, the platform's reusability makes it easily extensible with new data and analyses, paving the way for LAGOON to become a comprehensive means of assessing various OSS-based projects and their communities.


page 1

page 2

page 3

page 4


A Simple NLP-based Approach to Support Onboarding and Retention in Open-Source Communities

Successful open source communities are constantly looking for members an...

Analysis of the Social Community Based on the Network Growing Model in Open Source Software Community

The social community in open source software developers has a complex ne...

The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories

Communication surrounding the development of an open source project larg...

Attracting and Retaining OSS Contributors with a Maintainer Dashboard

Tools and artifacts produced by open source software (OSS) have been wov...

Incivility Detection in Open Source Code Review and Issue Discussions

Given the democratic nature of open source development, code review and ...

Leveraging Human Computation for Quality Assurance in Open Source Communities

Software developed under the open source development model (OSSD) has ri...

SMART: An Open Source Data Labeling Platform for Supervised Learning

SMART is an open source web application designed to help data scientists...