Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

03/23/2018
by   Craig Sherstan, et al.
0

Here we propose using the successor representation (SR) to accelerate learning in a constructive knowledge system based on general value functions (GVFs). In real-world settings like robotics for unstructured and dynamic environments, it is infeasible to model all meaningful aspects of a system and its environment by hand due to both complexity and size. Instead, robots must be capable of learning and adapting to changes in their environment and task, incrementally constructing models from their own experience. GVFs, taken from the field of reinforcement learning (RL), are a way of modeling the world as predictive questions. One approach to such models proposes a massive network of interconnected and interdependent GVFs, which are incrementally added over time. It is reasonable to expect that new, incrementally added predictions can be learned more swiftly if the learning process leverages knowledge gained from past experience. The SR provides such a means of separating the dynamics of the world from the prediction targets and thus capturing regularities that can be reused across multiple GVFs. As a primary contribution of this work, we show that using SR-based predictions can improve sample efficiency and learning speed in a continual learning setting where new predictions are incrementally added and learned over time. We analyze our approach in a grid-world and then demonstrate its potential on data from a physical robot arm.

READ FULL TEXT
research
06/29/2019

Continual Learning for Robotics

Continual learning (CL) is a particular machine learning paradigm where ...
research
10/09/2018

Continual State Representation Learning for Reinforcement Learning using Generative Replay

We consider the problem of building a state representation model in a co...
research
03/06/2019

Using World Models for Pseudo-Rehearsal in Continual Learning

The utility of learning a dynamics/world model of the environment in rei...
research
01/23/2020

What's a Good Prediction? Issues in Evaluating General Value Functions Through Error

Constructing and maintaining knowledge of the world is a central problem...
research
01/12/2023

Predictive World Models from Real-World Partial Observations

Cognitive scientists believe adaptable intelligent agents like humans pe...
research
11/03/2021

The effect of synaptic weight initialization in feature-based successor representation learning

After discovering place cells, the idea of the hippocampal (HPC) functio...
research
04/18/2019

Making Meaning: Semiotics Within Predictive Knowledge Architectures

Within Reinforcement Learning, there is a fledgling approach to conceptu...

Please sign up or login with your details

Forgot password? Click here to reset