Advantage Amplification in Slowly Evolving Latent-State Environments

05/29/2019
by   Martin Mladenov, et al.
0

Latent-state environments with long horizons, such as those faced by recommender systems, pose significant challenges for reinforcement learning (RL). In this work, we identify and analyze several key hurdles for RL in such environments, including belief state error and small action advantage. We develop a general principle of advantage amplification that can overcome these hurdles through the use of temporal abstraction. We propose several aggregation methods and prove they induce amplification in certain settings. We also bound the loss in optimality incurred by our methods in environments where latent state evolves slowly and demonstrate their performance empirically in a stylized user-modeling task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2019

RecSim: A Configurable Simulation Platform for Recommender Systems

We propose RecSim, a configurable platform for authoring simulation envi...
research
06/02/2019

The Principle of Unchanged Optimality in Reinforcement Learning Generalization

Several recent papers have examined generalization in reinforcement lear...
research
10/09/2022

State Advantage Weighting for Offline RL

We present state advantage weighting for offline reinforcement learning ...
research
06/14/2023

Off-policy Evaluation in Doubly Inhomogeneous Environments

This work aims to study off-policy evaluation (OPE) under scenarios wher...
research
01/20/2023

Generative Slate Recommendation with Reinforcement Learning

Recent research has employed reinforcement learning (RL) algorithms to o...
research
09/28/2021

Making Curiosity Explicit in Vision-based RL

Vision-based reinforcement learning (RL) is a promising technique to sol...
research
06/29/2020

Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper

Industry 4.0 systems have a high demand for optimization in their tasks,...

Please sign up or login with your details

Forgot password? Click here to reset