Design Principles for Data Analysis

03/09/2021
by   Lucy D'Agostino McGowan, et al.
0

The data science revolution has led to an increased interest in the practice of data analysis. While much has been written about statistical thinking, a complementary form of thinking that appears in the practice of data analysis is design thinking – the problem-solving process to understand the people for whom a product is being designed. For a given problem, there can be significant or subtle differences in how a data analyst (or producer of a data analysis) constructs, creates, or designs a data analysis, including differences in the choice of methods, tooling, and workflow. These choices can affect the data analysis products themselves and the experience of the consumer of the data analysis. Therefore, the role of a producer can be thought of as designing the data analysis with a set of design principles. Here, we introduce design principles for data analysis and describe how they can be mapped to data analyses in a quantitative, objective and informative manner. We also provide empirical evidence of variation of principles within and between both producers and consumers of data analyses. Our work leads to two insights: it suggests a formal mechanism to describe data analyses based on the design principles for data analysis, and it provides a framework to teach students how to build data analyses using formal design principles.

READ FULL TEXT

page 20

page 24

page 25

research
03/18/2019

Elements and Principles of Data Analysis

The data revolution has led to an increased interest in the practice of ...
research
07/17/2020

Principles for data analysis workflows

Traditional data science education often omits training on research work...
research
03/22/2021

New Perspectives on Centering

Data matrix centering is an ever-present yet under-examined aspect of da...
research
01/23/2019

Three principles of data science: predictability, computability, and stability (PCS)

We propose the predictability, computability, and stability (PCS) framew...
research
12/25/2018

A Variability-Aware Design Approach to the Data Analysis Modeling Process

The massive amount of current data has led to many different forms of da...
research
02/14/2019

The AtLarge Vision on the Design of Distributed Systems and Ecosystems

High-quality designs of distributed systems and services are essential f...
research
07/10/2020

Boba: Authoring and Visualizing Multiverse Analyses

Multiverse analysis is an approach to data analysis in which all "reason...

Please sign up or login with your details

Forgot password? Click here to reset