: a Python "smuggler" for constructing lightweight reproducible notebooks

11/23/2022
by   Paxton C. Fitzpatrick, et al.
0

Reproducibility is a core requirement of modern scientific research. For computational research, reproducibility means that code should produce the same results, even when run on different systems. A standard approach to ensuring reproducibility entails packaging a project's dependencies along with its primary code base. Existing solutions vary in how deeply these dependencies are specified, ranging from virtual environments, to containers, to virtual machines. Each of these existing solutions requires installing or setting up a system for running the desired code, increasing the complexity and time cost of sharing or engaging with reproducible science. Here, we propose a lighter-weight solution: the package. When used in combination with a notebook-based Python project, provides a mechanism for specifying (and automatically installing) the correct versions of the project's dependencies. The package further ensures that those packages and specific versions are used every time the notebook's code is executed. This enables researchers to share a complete reproducible copy of their code within a single Jupyter notebook file.

READ FULL TEXT
research
03/08/2023

RANG: Reconstructing reproducible R computational environments

A complete declarative description of the computational environment is o...
research
04/29/2022

Repro: An Open-Source Library for Improving the Reproducibility and Usability of Publicly Available Research Code

We introduce Repro, an open-source library which aims at improving the r...
research
03/04/2021

Restoring Execution Environments of Jupyter Notebooks

More than ninety percent of published Jupyter notebooks do not state dep...
research
09/09/2022

Computational reproducibility of Jupyter notebooks from biomedical publications

Jupyter notebooks allow to bundle executable code with its documentation...
research
08/03/2018

DataDeps.jl: Repeatable Data Setup for Replicable Data Science

We present DataDeps.jl: a julia package for the reproducible handling of...
research
10/05/2022

Be Prospective, Not Retrospective: A Philosophy for Advancing Reproducibility in Modern Biological Research

The ubiquity of computation in modern scientific research inflicts new c...
research
05/27/2019

DockerizeMe: Automatic Inference of Environment Dependencies for Python Code Snippets

Platforms like Stack Overflow and GitHub's gist system promote the shari...

Please sign up or login with your details

Forgot password? Click here to reset