Robust Decision-Focused Learning for Reward Transfer

04/06/2023
by   Abhishek Sharma, et al.
0

Decision-focused (DF) model-based reinforcement learning has recently been introduced as a powerful algorithm which can focus on learning the MDP dynamics which are most relevant for obtaining high rewards. While this approach increases the performance of agents by focusing the learning towards optimizing for the reward directly, it does so by learning less accurate dynamics (from a MLE standpoint), and may thus be brittle to changes in the reward function. In this work, we develop the robust decision-focused (RDF) algorithm which leverages the non-identifiability of DF solutions to learn models which maximize expected returns while simultaneously learning models which are robust to changes in the reward function. We demonstrate on a variety of toy example and healthcare simulators that RDF significantly increases the robustness of DF to changes in the reward function, without decreasing the overall return the agent obtains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2019

What Can Learned Intrinsic Rewards Capture?

Reinforcement learning agents can include different components, such as ...
research
12/08/2021

Application of Deep Reinforcement Learning to Payment Fraud

The large variety of digital payment choices available to consumers toda...
research
01/29/2018

Learning the Reward Function for a Misspecified Model

In model-based reinforcement learning it is typical to treat the problem...
research
09/26/2020

Online Learning of Non-Markovian Reward Models

There are situations in which an agent should receive rewards only after...
research
07/24/2023

Contrastive Example-Based Control

While many real-world problems that might benefit from reinforcement lea...
research
11/26/2021

Learning Long-Term Reward Redistribution via Randomized Return Decomposition

Many practical applications of reinforcement learning require agents to ...
research
10/29/2021

Xi-Learning: Successor Feature Transfer Learning for General Reward Functions

Transfer in Reinforcement Learning aims to improve learning performance ...

Please sign up or login with your details

Forgot password? Click here to reset