Counterfactual Control for Free from Generative Models

02/22/2017
by   Nicholas Guttenberg, et al.
0

We introduce a method by which a generative model learning the joint distribution between actions and future states can be used to automatically infer a control scheme for any desired reward function, which may be altered on the fly without retraining the model. In this method, the problem of action selection is reduced to one of gradient descent on the latent space of the generative model, with the model itself providing the means of evaluating outcomes and finding the gradient, much like how the reward network in Deep Q-Networks (DQN) provides gradient information for the action generator. Unlike DQN or Actor-Critic, which are conditional models for a specific reward, using a generative model of the full joint distribution permits the reward to be changed on the fly. In addition, the generated futures can be inspected to gain insight in to what the network 'thinks' will happen, and to what went wrong when the outcomes deviate from prediction.

READ FULL TEXT
research
01/25/2021

Conditional Generative Models for Counterfactual Explanations

Counterfactual instances offer human-interpretable insight into the loca...
research
11/19/2019

Implicit Generative Modeling for Efficient Exploration

Efficient exploration remains a challenging problem in reinforcement lea...
research
11/02/2020

Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models

The potential benefits of model-free reinforcement learning to real robo...
research
10/02/2019

CWAE-IRL: Formulating a supervised approach to Inverse Reinforcement Learning problem

Inverse reinforcement learning (IRL) is used to infer the reward functio...
research
11/15/2017

Latent Constraints: Learning to Generate Conditionally from Unconditional Generative Models

Deep generative neural networks have proven effective at both conditiona...
research
12/28/2020

Joint Intensity-Gradient Guided Generative Modeling for Colorization

This paper proposes an iterative generative model for solving the automa...
research
04/05/2022

Controllable Garment Transfer

Image-based garment transfer replaces the garment on the target human wi...

Please sign up or login with your details

Forgot password? Click here to reset