Self-Consistent Models and Values

10/25/2021
by   Gregory Farquhar, et al.
6

Learned models of the environment provide reinforcement learning (RL) agents with flexible ways of making predictions about the environment. In particular, models enable planning, i.e. using more computation to improve value functions or policies, without requiring additional environment interactions. In this work, we investigate a way of augmenting model-based RL, by additionally encouraging a learned model and value function to be jointly self-consistent. Our approach differs from classic planning methods such as Dyna, which only update values to be consistent with the model. We propose multiple self-consistency updates, evaluate these in both tabular and function approximation settings, and find that, with appropriate choices, self-consistency helps both policy evaluation and control.

READ FULL TEXT
research
06/08/2020

Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models

Dyna-style reinforcement learning (RL) agents improve sample efficiency ...
research
06/09/2021

Self-Paced Context Evaluation for Contextual Reinforcement Learning

Reinforcement learning (RL) has made a lot of advances for solving a sin...
research
12/19/2014

Grounding Hierarchical Reinforcement Learning Models for Knowledge Transfer

Methods of deep machine learning enable to to reuse low-level representa...
research
04/17/2021

Planning with Expectation Models for Control

In model-based reinforcement learning (MBRL), Wan et al. (2019) showed c...
research
06/11/2019

Learning Powerful Policies by Using Consistent Dynamics Model

Model-based Reinforcement Learning approaches have the promise of being ...
research
08/04/2022

Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Here, we report a case study implementation of reinforcement learning (R...
research
11/18/2019

Gamma-Nets: Generalizing Value Estimation over Timescale

We present Γ-nets, a method for generalizing value function estimation o...

Please sign up or login with your details

Forgot password? Click here to reset