A Model-Free and Finite-Population-Exact Framework for Randomized Experiments Subject to Outcome Misclassification via Integer Programming

01/09/2022
by   Siyu Heng, et al.
0

Randomized experiments (trials) are the gold standard for making causal inferences because randomization removes systematic confounding and the need for assuming any data-generating (super-population) models. However, outcome misclassification (e.g. measurement error or reporting bias in binary outcomes) often exists in practice and even a few misclassified outcomes may distort a causal conclusion drawn from a randomized experiment. All existing approaches to outcome misclassification rely on some data-generating model and therefore may not be applicable to randomized experiments without additional strong assumptions. We propose a model-free and finite-population-exact framework for randomized experiments subject to outcome misclassification, which does not require adding any additional assumptions to a randomized experiment. A central quantity in our framework is "warning accuracy," defined as the threshold such that the causal conclusion drawn from the measured outcomes may differ from that based on the true outcomes if the accuracy of the measured outcomes did not surpass that threshold. We show how learning the warning accuracy and related information and a dual concept can benefit the design, analysis, and validation of a randomized experiment. We show that the warning accuracy can be computed efficiently (even for large datasets) by adaptively reformulating an integer quadratically constrained linear program with respect to the randomization design. Our framework covers both Fisher's sharp null and Neyman's weak null, works for a wide range of randomization designs, and can also be applied to observational studies adopting randomization-based inference. We apply our framework to a large randomized clinical trial of the prevention of prostate cancer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2017

Sharpening randomization-based causal inference for 2^2 factorial designs with binary outcomes

In medical research, a scenario often entertained is randomized controll...
research
06/21/2023

Optimal allocation of sample size for randomization-based inference from 2^K factorial designs

Optimizing the allocation of units into treatment groups can help resear...
research
07/06/2021

Randomization-based Test for Censored Outcomes: A New Look at the Logrank Test

Two-sample tests have been one of the most classical topics in statistic...
research
03/12/2018

On finite-population Bayesian inferences for 2^K factorial designs with binary outcomes

Inspired by the pioneering work of Rubin (1978), we employ the potential...
research
03/22/2020

Panel Experiments and Dynamic Causal Effects: A Finite Population Perspective

In panel experiments, we randomly expose multiple units to different tre...
research
08/19/2021

Robust Designs for Prospective Randomized Trials Surveying Sensitive Topics

We consider the problem of designing a prospective randomized trial in w...
research
08/26/2019

Estimating Malaria Vaccine Efficacy in the Absence of a Gold Standard Case Definition: Mendelian Factorial Design

Accurate estimates of malaria vaccine efficacy require a reliable defini...

Please sign up or login with your details

Forgot password? Click here to reset