Principles for data analysis workflows

07/17/2020
by   Sara Stoudt, et al.
0

Traditional data science education often omits training on research workflows: the process that moves a scientific investigation from raw data to coherent research question to insightful contribution. In this paper, we elaborate basic principles of a reproducible data analysis workflow by defining three phases: the Exploratory, Refinement, and Polishing Phases. Each workflow phase is roughly centered around the audience to whom research decisions, methodologies, and results are being immediately communicated. Importantly, each phase can also give rise to a number of research products beyond traditional academic publications. Where relevant, we draw analogies between principles for data-intensive research workflows and established practice in software development. The guidance provided here is not intended to be a strict rulebook; rather, the suggestions for practices and tools to advance reproducible, sound data-intensive analysis may furnish support for both students and current professionals.

READ FULL TEXT
research
03/09/2021

Design Principles for Data Analysis

The data science revolution has led to an increased interest in the prac...
research
03/18/2019

Elements and Principles of Data Analysis

The data revolution has led to an increased interest in the practice of ...
research
01/19/2022

A Practical Approach of Actions for FAIRification Workflows

Since their proposal in 2016, the FAIR principles have been largely disc...
research
10/14/2019

code::proof: Prepare for most weather conditions

Computational tools for data analysis are being released daily on reposi...
research
12/23/2019

Teaching Responsible Data Science: Charting New Pedagogical Territory

Although numerous ethics courses are available, with many focusing speci...
research
04/24/2019

Organizing Network Management Logic with Circular Economy Principles

The traditional cycle of industrial products has been linear since its i...
research
04/10/2022

Iceberg Sensemaking: A Process Model for Critical Data Analysis and Visualization

We offer a new model of the sensemaking process for data science and vis...

Please sign up or login with your details

Forgot password? Click here to reset