Online Experimentation with Surrogate Metrics: Guidelines and a Case Study

06/02/2021
by   Weitao Duan, et al.
0

A/B tests have been widely adopted across industries as the golden rule that guides decision making. However, the long-term true north metrics we ultimately want to drive through A/B test may take a long time to mature. In these situations, a surrogate metric which predicts the long-term metric is often used instead to conclude whether the treatment is effective. However, because the surrogate rarely predicts the true north perfectly, a regular A/B test based on surrogate metrics tends to have high false positive rate and the treatment variant deemed favorable from the test may not be the winning one. In this paper, we discuss how to adjust the A/B testing comparison to ensure experiment results are trustworthy. We also provide practical guidelines on the choice of good surrogate metrics. To provide a concrete example of how to leverage surrogate metrics for fast decision making, we present a case study on developing and evaluating the predicted confirmed hire surrogate metric in LinkedIn job marketplace.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2023

Choosing a Proxy Metric from Past Experiments

In many randomized experiments, the treatment effect of the long-term me...
research
04/19/2023

Optimizing Carbon Storage Operations for Long-Term Safety

To combat global warming and mitigate the risks associated with climate ...
research
12/02/2020

Doubly-robust evaluation of high-dimensional surrogate markers

When evaluating the effectiveness of a treatment, policy, or interventio...
research
01/29/2022

Composing a surrogate observation operator for sequential data assimilation

In data assimilation, state estimation is not straightforward when the o...
research
05/29/2019

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Complex classification performance metrics such as the F_β-measure and J...
research
07/03/2023

Pareto optimal proxy metrics

North star metrics and online experimentation play a central role in how...
research
05/16/2023

Toward Falsifying Causal Graphs Using a Permutation-Based Test

Understanding the causal relationships among the variables of a system i...

Please sign up or login with your details

Forgot password? Click here to reset