Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains

06/03/2020
by   James Bannon, et al.
18

Reinforcement learning algorithms have had tremendous successes in online learning settings. However, these successes have relied on low-stakes interactions between the algorithmic agent and its environment. In many settings where RL could be of use, such as health care and autonomous driving, the mistakes made by most online RL algorithms during early training come with unacceptable costs. These settings require developing reinforcement learning algorithms that can operate in the so-called batch setting, where the algorithms must learn from set of data that is fixed, finite, and generated from some (possibly unknown) policy. Evaluating policies different from the one that collected the data is called off-policy evaluation, and naturally poses counter-factual questions. In this project we show how off-policy evaluation and the estimation of treatment effects in causal inference are two approaches to the same problem, and compare recent progress in these two areas.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2019

Benchmarking Batch Deep Reinforcement Learning Algorithms

Widely-used deep reinforcement learning algorithms have been shown to fa...
research
03/20/2019

Batch Policy Learning under Constraints

When learning policies for real-world domains, two important questions a...
research
08/08/2021

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

The recent emergence of reinforcement learning has created a demand for ...
research
06/07/2023

Timing Process Interventions with Causal Inference and Reinforcement Learning

The shift from the understanding and prediction of processes to their op...
research
04/10/2018

Personalization of Health Interventions using Cluster-Based Reinforcement Learning

Research has shown that personalization of health interventions can cont...
research
01/09/2021

Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment

Many batch RL health applications first discretize time into fixed inter...
research
06/18/2020

Deep Reinforcement Learning amidst Lifelong Non-Stationarity

As humans, our goals and our environment are persistently changing throu...

Please sign up or login with your details

Forgot password? Click here to reset