Online Testing of Subgroup Treatment Effects Based on Value Difference

09/02/2021
by   Miao Yu, et al.
0

Online A/B testing plays a critical role in the high-tech industry to guide product development and accelerate innovation. It performs a null hypothesis statistical test to determine which variant is better. However, a typical A/B test presents two problems: (i) a fixed-horizon framework inflates the false-positive errors under continuous monitoring; (ii) the homogeneous effects assumption fails to identify a subgroup with a beneficial treatment effect. In this paper, we propose a sequential test for subgroup treatment effects based on value difference, named SUBTLE, to address these two problems simultaneously. The SUBTLE allows the experimenters to "peek" at the results during the experiment without harming the statistical guarantees. It assumes heterogeneous treatment effects and aims to test if some subgroup of the population will benefit from the investigative treatment. If the testing result indicates the existence of such a subgroup, a subgroup will be identified using a readily available estimated optimal treatment rule. We examine the empirical performance of our proposed test on both simulations and a real dataset. The results show that the SUBTLE has high detection power with controlled type I error at any time, is more robust to noise covariates, and can achieve early stopping compared with the corresponding fixed-horizon test.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2020

A New Framework for Online Testing of Heterogeneous Treatment Effect

We propose a new framework for online testing of heterogeneous treatment...
research
11/06/2021

An Online Sequential Test for Qualitative Treatment Effects

Tech companies (e.g., Google or Facebook) often use randomized online ex...
research
02/22/2021

Interactive identification of individuals with positive treatment effect while controlling false discoveries

Out of the participants in a randomized experiment with anticipated hete...
research
09/04/2020

Instrument Validity for Heterogeneous Causal Effects

This paper provides a general framework for testing instrument validity ...
research
06/20/2023

Should I Stop or Should I Go: Early Stopping with Heterogeneous Populations

Randomized experiments often need to be stopped prematurely due to the t...
research
05/22/2019

Measuring Average Treatment Effect from Heavy-tailed Data

Heavy-tailed metrics are common and often critical to product evaluation...

Please sign up or login with your details

Forgot password? Click here to reset