When Provenance Aids and Complicates Reproducibility Judgments

08/14/2023
by   David Koop, et al.
0

It is well-established that the provenance of a scientific result is important, sometimes more important than the actual result. For computational analyses that involve visualization, this provenance information may contain the steps involved in generating visualizations from raw data. Specifically, data provenance tracks the lineage of data and process provenance tracks the steps executed. In this paper, we argue that the utility of computational provenance may not be as clear-cut as we might like. One common use case for provenance is that the information can be used to reproduce the original result. However, in visualization, the goal is often to communicate results to a user or viewer, and thus the insights obtained are ultimately most important. Viewers can miss important changes or react to unimportant ones. Here, interaction provenance, which tracks a user's actions with a visualization, or insight provenance, which tracks the decision-making process, can help capture what happened but don't remove the issues. In this paper, we present scenarios where provenance impacts reproducibility in different ways. We also explore how provenance and visualizations can be better related.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2021

Towards a Theory of Bullshit Visualization

In this unhinged rant, I lay out my suspicion that a lot of visualizatio...
research
06/22/2020

ReproduceMeGit: A Visualization Tool for Analyzing Reproducibility of Jupyter Notebooks

Computational notebooks have gained widespread adoption among researcher...
research
05/12/2020

Visual Analytics and Human Involvement in Machine Learning

The rapidly developing AI systems and applications still require human i...
research
07/26/2019

Discriminability Tests for Visualization Effectiveness and Scalability

The scalability of a particular visualization approach is limited by the...
research
01/17/2022

Process Visualization of Manufacturing Execution System (MES) Data

Process visualizations of data from manufacturing execution systems (MES...
research
12/20/2021

Systematic Literature Review on Cyber Situational Awareness Visualizations

The dynamics of cyber threats are increasingly complex, making it more c...
research
06/09/2021

An Extensible Dashboard Architecture For Visualizing Base And Analyzed Data

Any data analysis, especially the data sets that may be changing often o...

Please sign up or login with your details

Forgot password? Click here to reset