A fundamental problem of hypothesis testing with finite inventory in e-commerce

06/10/2020
by   Dennis Bohle, et al.
0

In this paper, we draw attention to a problem that is often overlooked or ignored by companies practicing hypothesis testing (A/B testing) in online environments. We show that conducting experiments on limited inventory that is shared between variants in the experiment can lead to high false positive rates since the core assumption of independence between the groups is violated. We provide a detailed analysis of the problem in a simplified setting whose parameters are informed by realistic scenarios. The setting we consider is a 2-dimensional random walk in a semi-infinite strip. It is rich enough to take a finite inventory into account, but is at the same time simple enough to allow for a closed form of the false-positive probability. We prove that high false-positive rates can occur, and develop tools that are suitable to help design adequate tests in follow-up work. Our results also show that high false-negative rates may occur. The proofs rely on a functional limit theorem for the 2-dimensional random walk in a semi-infinite strip.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2017

A Flexible Framework for Hypothesis Testing in High-dimensions

Hypothesis testing in the linear regression model is a fundamental stati...
research
03/24/2022

Methods for Large-scale Single Mediator Hypothesis Testing: Possible Choices and Comparisons

Mediation hypothesis testing for a large number of mediators is challeng...
research
06/16/2022

Estimating the lifetime risk of a false positive screening test result

False positive results in screening tests have potentially severe psycho...
research
03/26/2019

Non-asymptotic error controlled sparse high dimensional precision matrix estimation

Estimation of a high dimensional precision matrix is a critical problem ...
research
05/30/2023

Identifying the Complete Correlation Structure in Large-Scale High-Dimensional Data Sets with Local False Discovery Rates

The identification of the dependent components in multiple data sets is ...
research
11/23/2020

The Bloom Clock for Causality Testing

Testing for causality between events in distributed executions is a fund...
research
01/13/2020

Breaking hypothesis testing for failure rates

We describe the utility of point processes and failure rates and the mos...

Please sign up or login with your details

Forgot password? Click here to reset