Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks

02/22/2018
by   Daniel Tanneberg, et al.
0

Autonomous robots need to interact with unknown, unstructured and changing environments, constantly facing novel challenges. Therefore, continuous online adaptation for lifelong-learning and the need of sample-efficient mechanisms to adapt to changes in the environment, the constraints, the tasks, or the robot itself are crucial. In this work, we propose a novel framework for probabilistic online motion planning with online adaptation based on a bio-inspired stochastic recurrent neural network. By using learning signals which mimic the intrinsic motivation signal cognitive dissonance in addition with a mental replay strategy to intensify experiences, the stochastic recurrent network can learn from few physical interactions and adapts to novel environments in seconds. We evaluate our online planning and adaptation framework on an anthropomorphic KUKA LWR arm. The rapid online adaptation is shown by learning unknown workspace constraints sample-efficiently from few physical interactions while following given via points.

READ FULL TEXT
research
12/08/2018

Efficient transfer learning and online adaptation with latent variable models for continuous control

Traditional model-based RL relies on hand-specified or learned models of...
research
06/12/2018

Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains

Model-based strategies for control are critical to obtain sample efficie...
research
03/02/2023

STEP: Stochastic Traversability Evaluation and Planning for Risk-Aware Off-road Navigation; Results from the DARPA Subterranean Challenge

Although autonomy has gained widespread usage in structured and controll...
research
09/23/2020

Hierarchical Affordance Discovery using Intrinsic Motivation

To be capable of lifelong learning in a real-life environment, robots ha...
research
09/14/2022

Online Whole-body Motion Planning for Quadrotor using Multi-resolution Search

In this paper, we address the problem of online quadrotor whole-body mot...
research
06/16/2022

On-the-fly Adaptation of Patrolling Strategies in Changing Environments

We consider the problem of efficient patrolling strategy adaptation in a...

Please sign up or login with your details

Forgot password? Click here to reset