Latent Representation Prediction Networks

09/20/2020
by   Hlynur Davíð Hlynsson, et al.
0

Deeply-learned planning methods are often based on learning representations that are optimized for unrelated tasks. For example, they might be trained on reconstructing the environment. These representations are then combined with predictor functions for simulating rollouts to navigate the environment. We find this principle of learning representations unsatisfying and propose to learn them such that they are directly optimized for the task at hand: to be maximally predictable for the predictor function. This results in representations that are by design optimal for the downstream task of planning, where the learned predictor function is used as a forward model. To this end, we propose a new way of jointly learning this representation along with the prediction function, a system we dub Latent Representation Prediction Network (LARP). The prediction function is used as a forward model for search on a graph in a viewpoint-matching task and the representation learned to maximize predictability is found to outperform a pre-trained representation. Our approach is shown to be more sample-efficient than standard reinforcement learning methods and our learned representation transfers successfully to dissimilar objects.

READ FULL TEXT

page 12

page 13

page 14

research
09/12/2018

Combined Reinforcement Learning via Abstract Representations

In the quest for efficient and robust reinforcement learning methods, bo...
research
07/06/2023

When No-Rejection Learning is Optimal for Regression with Rejection

Learning with rejection is a prototypical model for studying the interac...
research
09/15/2020

Evaluating representations by the complexity of learning low-loss predictors

We consider the problem of evaluating representations of data for use in...
research
05/03/2021

Curious Representation Learning for Embodied Intelligence

Self-supervised representation learning has achieved remarkable success ...
research
12/12/2009

Closing the Learning-Planning Loop with Predictive State Representations

A central problem in artificial intelligence is that of planning to maxi...
research
02/26/2019

Functional Transparency for Structured Data: a Game-Theoretic Approach

We provide a new approach to training neural models to exhibit transpare...
research
09/28/2000

A Classification Approach to Word Prediction

The eventual goal of a language model is to accurately predict the value...

Please sign up or login with your details

Forgot password? Click here to reset