Reward-Predictive Clustering

11/07/2022
by   Lucas Lehnert, et al.
0

Recent advances in reinforcement-learning research have demonstrated impressive results in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating reinforcement-learning systems that can build abstractions of their experience to accelerate learning in new contexts still remains an active area of research. Previous work showed that reward-predictive state abstractions fulfill this goal, but have only be applied to tabular settings. Here, we provide a clustering algorithm that enables the application of such state abstractions to deep learning settings, providing compressed representations of an agent's inputs that preserve the ability to predict sequences of reward. A convergence theorem and simulations show that the resulting reward-predictive deep network maximally compresses the agent's inputs, significantly speeding up learning in high dimensional visual control tasks. Furthermore, we present different generalization experiments and analyze under which conditions a pre-trained reward-predictive representation network can be re-used without re-training to accelerate learning – a form of systematic out-of-distribution transfer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2019

Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards

While recent progress in deep reinforcement learning has enabled robots ...
research
05/16/2022

Deep Apprenticeship Learning for Playing Games

In the last decade, deep learning has achieved great success in machine ...
research
02/08/2020

Learning State Abstractions for Transfer in Continuous Control

Can simple algorithms with a good representation solve challenging reinf...
research
07/11/2023

Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Recent studies show that deep reinforcement learning (DRL) agents tend t...
research
05/07/2021

Reward prediction for representation learning and reward shaping

One of the fundamental challenges in reinforcement learning (RL) is the ...
research
05/16/2020

Concept Learning in Deep Reinforcement Learning

Deep reinforcement learning techniques have shown to be a promising path...
research
06/18/2021

High-level Features for Resource Economy and Fast Learning in Skill Transfer

Abstraction is an important aspect of intelligence which enables agents ...

Please sign up or login with your details

Forgot password? Click here to reset