Deep compositional robotic planners that follow natural language commands

02/12/2020
by   Yen-Ling Kuo, et al.
0

We demonstrate how a sampling-based robotic planner can be augmented to learn to understand a sequence of natural language commands in a continuous configuration space to move and manipulate objects. Our approach combines a deep network structured according to the parse of a complex command that includes objects, verbs, spatial relations, and attributes, with a sampling-based planner, RRT. A recurrent hierarchical deep network controls how the planner explores the environment, determines when a planned path is likely to achieve a goal, and estimates the confidence of each move to trade off exploitation and exploration between the network and the planner. Planners are designed to have near-optimal behavior when information about the task is missing, while networks learn to exploit observations which are available from the environment, making the two naturally complementary. Combining the two enables generalization to new maps, new kinds of obstacles, and more complex sentences that do not occur in the training set. Little data is required to train the model despite it jointly acquiring a CNN that extracts features from the environment as it learns the meanings of words. The model provides a level of interpretability through the use of attention maps allowing users to see its reasoning steps despite being an end-to-end model. This end-to-end model allows robots to learn to follow natural language commands in challenging continuous environments.

READ FULL TEXT

page 1

page 6

research
07/28/2023

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

We study how vision-language models trained on Internet-scale data can b...
research
10/01/2018

Deep sequential models for sampling-based planning

We demonstrate how a sequence model and a sampling-based planner can inf...
research
05/24/2020

Learning visual servo policies via planner cloning

Learning control policies for visual servoing in novel environments is a...
research
09/12/2018

Safe Navigation with Human Instructions in Complex Scenes

In this paper, we present a robotic navigation algorithm with natural la...
research
03/02/2023

Reshaping Viscoelastic-String Path-Planner (RVP)

We present Reshaping Viscoelastic-String Path-Planner a Path Planner tha...
research
07/06/2022

Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

We provide a study of how induced model sparsity can help achieve compos...
research
12/03/2018

Mitigating Planner Overfitting in Model-Based Reinforcement Learning

An agent with an inaccurate model of its environment faces a difficult c...

Please sign up or login with your details

Forgot password? Click here to reset