Toward Enabling Reproducibility for Data-Intensive Research using the Whole Tale Platform

05/12/2020
by   Kyle Chard, et al.
0

Whole Tale http://wholetale.org is a web-based, open-source platform for reproducible research supporting the creation, sharing, execution, and verification of "Tales" for the scientific research community. Tales are executable research objects that capture the code, data, and environment along with narrative and workflow information needed to re-create computational results from scientific studies. Creating reproducible research objects that enable reproducibility, transparency, and re-execution for computational experiments requiring significant compute resources or utilizing massive data is an especially challenging open problem. We describe opportunities, challenges, and solutions to facilitating reproducibility for data- and compute-intensive research, that we call "Tales at Scale," using the Whole Tale computing platform. We highlight challenges and solutions in frontend responsiveness needs, gaps in current middleware design and implementation, network restrictions, containerization, and data access. Finally, we discuss challenges in packaging computational experiment implementations for portable data-intensive Tales and outline future work.

READ FULL TEXT

page 5

page 6

research
05/06/2020

Advancing computational reproducibility in the Dataverse data repository platform

Recent reproducibility case studies have raised concerns showing that mu...
research
06/29/2023

A Backend Platform for Supporting the Reproducibility of Computational Experiments

In recent years, the research community has raised serious questions abo...
research
06/03/2023

brainlife.io: A decentralized and open source cloud platform to support neuroscience research

Neuroscience research has expanded dramatically over the past 30 years b...
research
05/01/2018

Computing Environments for Reproducibility: Capturing the "Whole Tale"

The act of sharing scientific knowledge is rapidly evolving away from tr...
research
03/24/2021

SCHeMa: Scheduling Scientific Containers on a Cluster of Heterogeneous Machines

In the era of data-driven science, conducting computational experiments ...
research
05/31/2022

Computational Reproducibility Within Prognostics and Health Management

Scientific research frequently involves the use of computational tools a...
research
03/03/2022

SIERRA: A Modular Framework for Research Automation

Modern intelligent systems researchers employ the scientific method: the...

Please sign up or login with your details

Forgot password? Click here to reset