Large-Scale Online Experimentation with Quantile Metrics

03/20/2019
by   Min Liu, et al.
0

Online experimentation (or A/B testing) has been widely adopted in industry as the gold standard for measuring product impacts. Despite the wide adoption, few literatures discuss A/B testing with quantile metrics. Quantile metrics, such as 90th percentile page load time, are crucial to A/B testing as many key performance metrics including site speed and service latency are defined as quantiles. However, with LinkedIn's data size, quantile metric A/B testing is extremely challenging because there is no statistically valid and scalable variance estimator for the quantile of dependent samples: the bootstrap estimator is statistically valid, but takes days to compute; the standard asymptotic variance estimate is scalable but results in order-of-magnitude underestimation. In this paper, we present a statistically valid and scalable methodology for A/B testing with quantiles that is fully generalizable to other A/B testing platforms. It achieves over 500 times speed up compared to bootstrap and has only 2% chance to differ from bootstrap estimates. Beyond methodology, we also share the implementation of a data pipeline using this methodology and insights on pipeline optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2020

CONQ: CONtinuous Quantile Treatment Effects for Large-Scale Online Controlled Experiments

In many industry settings, online controlled experimentation (A/B test) ...
research
03/06/2023

Quantile-Quantile Methodology – Detailed Results

The linear quantile-quantile relationship provides an easy-to-implement ...
research
09/12/2019

Fast Algorithms for the Quantile Regression Process

The widespread use of quantile regression methods depends crucially on t...
research
12/27/2018

Quantile Treatment Effects and Bootstrap Inference under Covariate-Adaptive Randomization

This paper studies the estimation and inference of the quantile treatmen...
research
11/29/2020

How to Measure Your App: A Couple of Pitfalls and Remedies in Measuring App Performance in Online Controlled Experiments

Effectively measuring, understanding, and improving mobile app performan...
research
01/15/2019

Quantile Tracking in Dynamically Varying Data Streams Using a Generalized Exponentially Weighted Average of Observations

The Exponentially Weighted Average (EWA) of observations is known to be ...
research
12/20/2022

Probabilistic quantile factor analysis

This paper extends quantile factor analysis to a probabilistic variant t...

Please sign up or login with your details

Forgot password? Click here to reset