Evaluating the Success of a Data Analysis

04/26/2019
by   Stephanie C. Hicks, et al.
0

A fundamental problem in the practice and teaching of data science is how to evaluate the quality of a given data analysis, which is different than the evaluation of the science or question underlying the data analysis. Previously, we defined a set of principles for describing data analyses that can be used to create a data analysis and to characterize the variation between data analyses. Here, we introduce a metric of quality evaluation that we call the success of a data analysis, which is different than other potential metrics such as completeness, validity, or honesty. We define a successful data analysis as the matching of principles between the analyst and the audience on which the analysis is developed. In this paper, we propose a statistical model and general framework for evaluating the success of a data analysis. We argue that this framework can be used as a guide for practicing data scientists and students in data science courses for how to build a successful data analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2019

Elements and Principles of Data Analysis

The data revolution has led to an increased interest in the practice of ...
research
08/31/2023

In-class Data Analysis Replications: Teaching Students while Testing Science

Science is facing a reproducibility crisis. Previous work has proposed i...
research
01/02/2023

Science Platforms for Heliophysics Data Analysis

We recommend that NASA maintain and fund science platforms that enable i...
research
12/25/2018

A Variability-Aware Design Approach to the Data Analysis Modeling Process

The massive amount of current data has led to many different forms of da...
research
09/14/2018

The Generic Holdout: Preventing False-Discoveries in Adaptive Data Science

Adaptive data analysis has posed a challenge to science due to its abili...
research
10/15/2021

A Static Analysis Framework for Data Science Notebooks

Notebooks provide an interactive environment for programmers to develop ...
research
07/19/2019

Continuously Updated Data Analysis Systems

When doing data science, it's important to know what you're building. Th...

Please sign up or login with your details

Forgot password? Click here to reset