Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning

11/01/2017
by   Wenbin Li, et al.
0

Understanding physical phenomena is a key component of human intelligence and enables physical interaction with previously unseen environments. In this paper, we study how an artificial agent can autonomously acquire this intuition through interaction with the environment. We created a synthetic block stacking environment with physics simulation in which the agent can learn a policy end-to-end through trial and error. Thereby, we bypass to explicitly model physical knowledge within the policy. We are specifically interested in tasks that require the agent to reach a given goal state that may be different for every new trial. To this end, we propose a deep reinforcement learning framework that learns policies which are parametrized by a goal. We validated the model on a toy example navigating in a grid world with different target positions and in a block stacking task with different target structures of the final tower. In contrast to prior work, our policies show better generalization across different goals.

READ FULL TEXT
research
04/19/2019

Learning Manipulation under Physics Constraints with Visual Perception

Understanding physical phenomena is a key competence that enables humans...
research
10/23/2019

Learning Deep Parameterized Skills from Demonstration for Re-targetable Visuomotor Control

Robots need to learn skills that can not only generalize across similar ...
research
05/13/2019

Task-Agnostic Dynamics Priors for Deep Reinforcement Learning

While model-based deep reinforcement learning (RL) holds great promise f...
research
06/17/2018

Learning Policy Representations in Multiagent Systems

Modeling agent behavior is central to understanding the emergence of com...
research
11/27/2020

Autonomous learning of multiple, context-dependent tasks

When facing the problem of autonomously learning multiple tasks with rei...
research
03/03/2016

Learning Physical Intuition of Block Towers by Example

Wooden blocks are a common toy for infants, allowing them to develop mot...
research
10/25/2021

Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning

Unsupervised reinforcement learning aims to acquire skills without prior...

Please sign up or login with your details

Forgot password? Click here to reset