Reinforcement Learning for Improving Agent Design

10/09/2018
by   David Ha, et al.
10

In many reinforcement learning tasks, the goal is to learn a policy to manipulate an agent, whose design is fixed, to maximize some notion of cumulative reward. The design of the agent's physical structure is rarely optimized for the task at hand. In this work, we explore the possibility of learning a version of the agent's design that is better suited for its task, jointly with the policy. We propose a minor alteration to the OpenAI Gym framework, where we parameterize parts of an environment, and allow an agent to jointly learn to modify these environment parameters along with its policy. We demonstrate that an agent can learn a better structure of its body that is not only better suited for the task, but also facilitates policy learning. Joint learning of policy and structure may even uncover design principles that are useful for assisted-design applications. Videos of results at https://designrl.github.io/

READ FULL TEXT

page 1

page 3

page 4

page 5

research
04/04/2021

Influencing Reinforcement Learning through Natural Language Guidance

Interactive reinforcement learning agents use human feedback or instruct...
research
03/27/2018

World Models

We explore building generative neural network models of popular reinforc...
research
10/27/2022

Meta-Reinforcement Learning Using Model Parameters

In meta-reinforcement learning, an agent is trained in multiple differen...
research
06/26/2022

Learning to Rearrange with Physics-Inspired Risk Awareness

Real-world applications require a robot operating in the physical world ...
research
08/09/2017

Decoupled Learning of Environment Characteristics for Safe Exploration

Reinforcement learning is a proven technique for an agent to learn a tas...
research
12/05/2019

Iterative Policy-Space Expansion in Reinforcement Learning

Humans and animals solve a difficult problem much more easily when they ...
research
11/24/2020

Time Limits in Reinforcement Learning

In reinforcement learning, it is common to let an agent interact for a f...

Please sign up or login with your details

Forgot password? Click here to reset