A Reinforcement Learning Approach to Estimating Long-term Treatment Effects

10/14/2022
by   Ziyang Tang, et al.
0

Randomized experiments (a.k.a. A/B tests) are a powerful tool for estimating treatment effects, to inform decisions making in business, healthcare and other applications. In many problems, the treatment has a lasting effect that evolves over time. A limitation with randomized experiments is that they do not easily extend to measure long-term effects, since running long experiments is time-consuming and expensive. In this paper, we take a reinforcement learning (RL) approach that estimates the average reward in a Markov process. Motivated by real-world scenarios where the observed state transition is nonstationary, we develop a new algorithm for a class of nonstationary problems, and demonstrate promising results in two synthetic datasets and one online store dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2020

A Reinforcement Learning Framework for Time-Dependent Causal Effects Evaluation in A/B Testing

A/B testing, or online experiment is a standard business strategy to com...
research
09/14/2023

Choosing a Proxy Metric from Past Experiments

In many randomized experiments, the treatment effect of the long-term me...
research
05/31/2018

Evaluating Reinforcement Learning Algorithms in Observational Health Settings

Much attention has been devoted recently to the development of machine l...
research
09/28/2021

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies t...
research
02/18/2021

Novelty and Primacy: A Long-Term Estimator for Online Experiments

Online experiments are the gold standard for evaluating impact on user e...
research
02/25/2022

Ensemble Method for Estimating Individualized Treatment Effects

In many medical and business applications, researchers are interested in...
research
05/06/2020

DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret

Dynamic treatment regimes (DTRs) for are personalized, sequential treatm...

Please sign up or login with your details

Forgot password? Click here to reset