Information Theoretically Aided Reinforcement Learning for Embodied Agents

05/31/2016
by   Guido Montufar, et al.
0

Reinforcement learning for embodied agents is a challenging problem. The accumulated reward to be optimized is often a very rugged function, and gradient methods are impaired by many local optimizers. We demonstrate, in an experimental setting, that incorporating an intrinsic reward can smoothen the optimization landscape while preserving the global optimizers of interest. We show that policy gradient optimization for locomotion in a complex morphology is significantly improved when supplementing the extrinsic reward by an intrinsic reward defined in terms of the mutual information of time consecutive sensor readings.

READ FULL TEXT

page 5

page 9

page 13

page 15

page 16

page 17

research
10/12/2019

Influence-Based Multi-Agent Exploration

Intrinsically motivated reinforcement learning aims to address the explo...
research
07/07/2017

Emergence of Locomotion Behaviours in Rich Environments

The reinforcement learning paradigm allows, in principle, for complex be...
research
05/14/2022

Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments

Visualizing optimization landscapes has led to many fundamental insights...
research
06/18/2018

Learning from Outside the Viability Kernel: Why we Should Build Robots that can Fall with Grace

Despite impressive results using reinforcement learning to solve complex...
research
12/18/2017

ES Is More Than Just a Traditional Finite-Difference Approximator

An evolution strategy (ES) variant recently attracted significant attent...
research
09/26/2019

High-Dimensional Control Using Generalized Auxiliary Tasks

A long-standing challenge in reinforcement learning is the design of fun...
research
05/05/2022

Chemoreception and chemotaxis of a three-sphere swimmer

The coupled problem of hydrodynamics and solute transport for the Najafi...

Please sign up or login with your details

Forgot password? Click here to reset