Boba: Authoring and Visualizing Multiverse Analyses

07/10/2020
by   Yang Liu, et al.
0

Multiverse analysis is an approach to data analysis in which all "reasonable" analytic decisions are evaluated in parallel and interpreted collectively, in order to foster robustness and transparency. However, specifying a multiverse is demanding because analysts must manage myriad variants from a cross-product of analytic decisions, and the results require nuanced interpretation. We contribute Boba: an integrated domain-specific language (DSL) and visual analysis system for authoring and reviewing multiverse analyses. With the Boba DSL, analysts write the shared portion of analysis code only once, alongside local variations defining alternative decisions, from which the compiler generates a multiplex of scripts representing all possible analysis paths. The Boba Visualizer provides linked views of model results and the multiverse decision space to enable rapid, systematic assessment of consequential decisions and robustness, including sampling uncertainty and model fit. We demonstrate Boba's utility through two data analysis case studies, and reflect on challenges and design opportunities for multiverse analysis software.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2019

Decision-Making Under Uncertainty in Research Synthesis: Designing for the Garden of Forking Paths

To make evidence-based recommendations to decision-makers, researchers c...
research
06/10/2022

Smallset Timelines: A Visual Representation of Data Preprocessing Decisions

Data preprocessing is a crucial stage in the data analysis pipeline, wit...
research
10/30/2019

Paths Explored, Paths Omitted, Paths Obscured: Decision Points Selective Reporting in End-to-End Data Analysis

Drawing reliable inferences from data involves many, sometimes arbitrary...
research
03/09/2021

Design Principles for Data Analysis

The data science revolution has led to an increased interest in the prac...
research
09/15/2023

Modeling Data Analytic Iteration With Probabilistic Outcome Sets

In 1977 John Tukey described how in exploratory data analysis, data anal...

Please sign up or login with your details

Forgot password? Click here to reset