Guiding Policies with Language via Meta-Learning

11/19/2018
by   John D. Co-Reyes, et al.
8

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their disadvantages: reward functions require manual engineering, while demonstrations require a human expert to be able to actually perform the task in order to generate the demonstration. Instruction following from natural language instructions provides an appealing alternative: in the same way that we can specify goals to other humans simply by speaking or writing, we would like to be able to specify tasks for our machines. However, a single instruction may be insufficient to fully communicate our intent or, even if it is, may be insufficient for an autonomous agent to actually understand how to perform the desired task. In this work, we propose an interactive formulation of the task specification problem, where iterative language corrections are provided to an autonomous agent, guiding it in acquiring the desired skill. Our proposed language-guided policy learning algorithm can integrate an instruction and a sequence of corrections to acquire new skills very quickly. In our experiments, we show that this method can enable a policy to follow instructions and corrections for simulated navigation and manipulation tasks, substantially outperforming direct, non-interactive instruction following.

READ FULL TEXT

page 7

page 9

page 14

page 15

page 16

research
08/16/2020

Inverse Reinforcement Learning with Natural Language Goals

Humans generally use natural language to communicate task requirements a...
research
10/10/2022

Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks

Demonstrations and natural language instructions are two common ways to ...
research
12/17/2022

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

Physical interactions can often help reveal information that is not read...
research
06/05/2021

Zero-shot Task Adaptation using Natural Language

Imitation learning and instruction-following are two common approaches t...
research
02/28/2022

LISA: Learning Interpretable Skill Abstractions from Language

Learning policies that effectually utilize language instructions in comp...
research
01/24/2023

Language-guided Task Adaptation for Imitation Learning

We introduce a novel setting, wherein an agent needs to learn a task fro...
research
10/21/2019

Self-Educated Language Agent With Hindsight Experience Replay For Instruction Following

Language creates a compact representation of the world and allows the de...

Please sign up or login with your details

Forgot password? Click here to reset