DataDeps.jl: Repeatable Data Setup for Replicable Data Science

08/03/2018
by   Lyndon White, et al.
0

We present DataDeps.jl: a julia package for the reproducible handling of static datasets to enhance the repeatability of scripts used in the data and computational sciences. It is used to automate the data setup part of running software which accompanies a paper to replicate a result. This step is commonly done manually, which expends time and allows for confusion. This functionality is also useful for other packages which require data to function (e.g. a trained machine learning based model). DataDeps.jl simplifies extending research software by automatically managing the dependencies and makes it easier to run another author's code, thus enhancing the reproducibility of data science research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2021

MLDev: Data Science Experiment Automation and Reproducibility Software

In this paper we explore the challenges of automating experiments in dat...
research
04/23/2020

Human-Machine Collaboration for Democratizing Data Science

Everybody wants to analyse their data, but only few posses the data scie...
research
02/19/2022

Tools and Recommendations for Reproducible Teaching

It is recommended that teacher-scholars of data science adopt reproducib...
research
10/22/2019

How can AI Automate End-to-End Data Science?

Data science is labor-intensive and human experts are scarce but heavily...
research
11/23/2022

: a Python "smuggler" for constructing lightweight reproducible notebooks

Reproducibility is a core requirement of modern scientific research. For...
research
11/08/2022

Caching and Reproducibility: Making Data Science experiments faster and FAIRer

Small to medium-scale data science experiments often rely on research so...
research
10/15/2021

A Static Analysis Framework for Data Science Notebooks

Notebooks provide an interactive environment for programmers to develop ...

Please sign up or login with your details

Forgot password? Click here to reset