Replicating Data Pipelines with GrimoireLab

05/05/2022
by   Kalvin Eng, et al.
0

In this paper, we present our MSR Hackathon 2022 project that replicates an existing Gitter study using GrimoireLab. We compare the previous study's pipeline with our GrimoireLab implementation in terms of speed, data consistency, organization, and the learning curve to get started. We believe our experience with GrimoireLab can help future researchers in making the right choice while implementing their data pipelines over Gitter and Github data.

READ FULL TEXT

page 1

page 2

page 3

research
05/06/2020

Testing the Robustness of AutoML Systems

Automated machine learning (AutoML) systems aim at finding the best mach...
research
10/21/2021

Viash: from scripts to pipelines

Most bioinformatics pipelines consist of software components that are ti...
research
07/05/2021

Exploring Data Pipelines through the Process Lens: a Reference Model forComputer Vision

Researchers have identified datasets used for training computer vision (...
research
09/13/2021

Project Pipeline: Preservation, Persistence, and Performance

Preservation pipelines demonstrate extended value when digitized content...
research
06/29/2022

The Vera C. Rubin Observatory Data Butler and Pipeline Execution System

The Rubin Observatory's Data Butler is designed to allow data file locat...
research
01/03/2018

Prediction of corrosions in Gas and Oil pipelines based on the theory of records

Predictions of corrosions in pipelines are valuable. Based on the availa...
research
05/05/2022

Study on the ERP Implementation Methodologies on SAP, Oracle NetSuite, and Microsoft Dynamics 365: A Review

There are Top three vendors in the ERP market: SAP, Oracle Net Suite and...

Please sign up or login with your details

Forgot password? Click here to reset