Composing Task Knowledge with Modular Successor Feature Approximators

01/28/2023
by   Wilka Carvalho, et al.
0

Recently, the Successor Features and Generalized Policy Improvement (SF GPI) framework has been proposed as a method for learning, composing, and transferring predictive knowledge and behavior. SF GPI works by having an agent learn predictive representations (SFs) that can be combined for transfer to new tasks with GPI. However, to be effective this approach requires state features that are useful to predict, and these state-features are typically hand-designed. In this work, we present a novel neural network architecture, "Modular Successor Feature Approximators" (MSFA), where modules both discover what is useful to predict, and learn their own predictive representations. We show that MSFA is able to better generalize compared to baseline architectures for learning SFs and modular architectures

READ FULL TEXT

page 7

page 17

page 18

research
06/26/2018

Modular meta-learning

Many prediction problems, such as those that arise in the context of rob...
research
06/22/2020

What shapes feature representations? Exploring datasets, architectures, and training

In naturalistic learning problems, a model's input contains a wide range...
research
08/14/2010

Discover & eXplore Neural Network (DXNN) Platform, a Modular TWEANN

In this paper I present a novel type of Topology and Weight Evolving Art...
research
06/02/2023

Independent Modular Networks

Monolithic neural networks that make use of a single set of weights to l...
research
12/21/2021

Provable Hierarchical Lifelong Learning with a Sketch-based Modular Architecture

We propose a modular architecture for the lifelong learning of hierarchi...
research
01/23/2020

What's a Good Prediction? Issues in Evaluating General Value Functions Through Error

Constructing and maintaining knowledge of the world is a central problem...
research
04/13/2012

Compensating Interpolation Distortion by Using New Optimized Modular Method

A modular method was suggested before to recover a band limited signal f...

Please sign up or login with your details

Forgot password? Click here to reset