Hypothesis Formalization: Empirical Findings, Software Limitations, and Design Implications

04/06/2021
by   Eunice Jun, et al.
0

Data analysis requires translating higher level questions and hypotheses into computable statistical models. We present a mixed-methods study aimed at identifying the steps, considerations, and challenges involved in operationalizing hypotheses into statistical models, a process we refer to as hypothesis formalization. In a formative content analysis of research papers, we find that researchers highlight decomposing a hypothesis into sub-hypotheses, selecting proxy variables, and formulating statistical models based on data collection design as key steps. In a lab study, we find that analysts fixated on implementation and shaped their analysis to fit familiar approaches, even if sub-optimal. In an analysis of software tools, we find that tools provide inconsistent, low-level abstractions that may limit the statistical models analysts use to formalize hypotheses. Based on these observations, we characterize hypothesis formalization as a dual-search process balancing conceptual and statistical considerations constrained by data and computation, and discuss implications for future tools.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2022

Post-clustering difference testing: valid inference and practical considerations

Clustering is part of unsupervised analysis methods that consist in grou...
research
11/21/2021

Confidences in Hypotheses

This article introduces a broadly-applicable new method of statistical a...
research
08/16/2018

From MVPs to pivots: a hypothesis-driven journey of two software startups

Software startups have emerged as an interesting multiperspective resear...
research
04/10/2019

Tea: A High-level Language and Runtime System for Automating Statistical Analysis

Though statistical analyses are centered on research questions and hypot...
research
04/29/2022

A Grammar for Hypothesis-Driven Visual Analysis

A hallmark of visual analytics is its ability to support users in transl...
research
07/27/2017

The Topology of Statistical Verifiability

Topological models of empirical and formal inquiry are increasingly prev...
research
10/08/2020

Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions

Frequentist statistical methods, such as hypothesis testing, are standar...

Please sign up or login with your details

Forgot password? Click here to reset