Denoised MDPs: Learning World Models Better Than the World Itself

06/30/2022
by   Tongzhou Wang, et al.
8

The ability to separate signal from noise, and reason with clean abstractions, is critical to intelligence. With this ability, humans can efficiently perform real world tasks without considering all possible nuisance factors.How can artificial agents do the same? What kind of information can agents safely discard as noises? In this work, we categorize information out in the wild into four types based on controllability and relation with reward, and formulate useful information as that which is both controllable and reward-relevant. This framework clarifies the kinds information removed by various prior work on representation learning in reinforcement learning (RL), and leads to our proposed approach of learning a Denoised MDP that explicitly factors out certain noise distractors. Extensive experiments on variants of DeepMind Control Suite and RoboDesk demonstrate superior performance of our denoised world model over using raw observations alone, and over prior works, across policy optimization control tasks as well as the non-control task of joint position regression.

READ FULL TEXT

page 1

page 7

page 9

page 20

page 21

page 22

research
05/11/2022

A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning

While reinforcement learning (RL) provides a framework for learning thro...
research
11/28/2022

Tackling Visual Control via Multi-View Exploration Maximization

We present MEM: Multi-view Exploration Maximization for tackling complex...
research
05/07/2021

Reward prediction for representation learning and reward shaping

One of the fundamental challenges in reinforcement learning (RL) is the ...
research
04/18/2022

INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL

Model-based reinforcement learning (RL) algorithms designed for handling...
research
10/17/2021

Provable RL with Exogenous Distractors via Multistep Inverse Dynamics

Many real-world applications of reinforcement learning (RL) require the ...
research
07/24/2020

Predictive Information Accelerates Learning in RL

The Predictive Information is the mutual information between the past an...
research
02/26/2018

Disentangling the independently controllable factors of variation by interacting with the world

It has been postulated that a good representation is one that disentangl...

Please sign up or login with your details

Forgot password? Click here to reset