Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets

04/12/2022
by   Yunfei Li, et al.
0

Can a robot autonomously learn to design and construct a bridge from varying-sized blocks without a blueprint? It is a challenging task with long horizon and sparse reward – the robot has to figure out physically stable design schemes and feasible actions to manipulate and transport blocks. Due to diverse block sizes, the state space and action trajectories are vast to explore. In this paper, we propose a hierarchical approach for this problem. It consists of a reinforcement-learning designer to propose high-level building instructions and a motion-planning-based action generator to manipulate blocks at the low level. For high-level learning, we develop a novel technique, prioritized memory resetting (PMR) to improve exploration. PMR adaptively resets the state to those most critical configurations from a replay buffer so that the robot can resume training on partial architectures instead of from scratch. Furthermore, we augment PMR with auxiliary training objectives and fine-tune the designer with the locomotion generator. Our experiments in simulation and on a real deployed robotic system demonstrate that it is able to effectively construct bridges with blocks of varying sizes at a high success rate. Demos can be found at https://sites.google.com/view/bridge-pmr.

READ FULL TEXT

page 1

page 5

page 6

research
08/05/2021

Learning to Design and Construct Bridge without Blueprint

Autonomous assembly has been a desired functionality of many intelligent...
research
05/23/2019

From semantics to execution: Integrating action planning with reinforcement learning for robotic tool use

Reinforcement learning is an appropriate and successful method to robust...
research
03/08/2022

Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery

Robot assembly discovery is a challenging problem that lives at the inte...
research
02/01/2022

RFUniverse: A Physics-based Action-centric Interactive Environment for Everyday Household Tasks

Household environments are important testbeds for embodied AI research. ...
research
03/10/2022

Learn2Assemble with Structured Representations and Search for Robotic Architectural Construction

Autonomous robotic assembly requires a well-orchestrated sequence of hig...
research
10/02/2018

Time Reversal as Self-Supervision

A longstanding challenge in robot learning for manipulation tasks has be...

Please sign up or login with your details

Forgot password? Click here to reset