Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

11/01/2022
by   Alexey Skrynnik, et al.
0

The adoption of pre-trained language models to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a language model and reinforcement learning for the task of building objects in a Minecraft-like environment according to the natural language instructions. Our method first generates a set of consistently achievable sub-goals from the instructions and then completes associated sub-tasks with a pre-trained RL policy. The proposed method formed the RL baseline at the IGLU 2022 competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2023

Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Designing appropriate reward functions for Reinforcement Learning (RL) a...
research
07/15/2023

Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?

Pre-trained text-to-image generative models can produce diverse, semanti...
research
05/19/2020

Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

Recent work has described neural-network-based agents that are trained w...
research
07/24/2023

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Pre-trained large language models (LLMs) have recently achieved better g...
research
03/20/2019

Prospection: Interpretable Plans From Language By Predicting the Future

High-level human instructions often correspond to behaviors with multipl...
research
09/06/2023

Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

Reinforcement learning (RL) with linear temporal logic (LTL) objectives ...
research
06/16/2018

Scheduled Policy Optimization for Natural Language Communication with Intelligent Agents

We investigate the task of learning to follow natural language instructi...

Please sign up or login with your details

Forgot password? Click here to reset