Interactive Hierarchical Guidance using Language

10/09/2021
by   Bharat Prakash, et al.
0

Reinforcement learning has been successful in many tasks ranging from robotic control, games, energy management etc. In complex real world environments with sparse rewards and long task horizons, sample efficiency is still a major challenge. Most complex tasks can be easily decomposed into high-level planning and low level control. Therefore, it is important to enable agents to leverage the hierarchical structure and decompose bigger tasks into multiple smaller sub-tasks. We introduce an approach where we use language to specify sub-tasks and a high-level planner issues language commands to a low level controller. The low-level controller executes the sub-tasks based on the language commands. Our experiments show that this method is able to solve complex long horizon planning tasks with limited human supervision. Using language has added benefit of interpretability and ability for expert humans to take over the high-level planning task and provide language commands if necessary.

READ FULL TEXT

page 2

page 3

research
10/11/2022

DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning (HRL) has made notable progress in c...
research
11/23/2018

Hierarchical visuomotor control of humanoids

We aim to build complex humanoid agents that integrate perception, motor...
research
06/11/2020

From proprioception to long-horizon planning in novel environments: A hierarchical RL model

For an intelligent agent to flexibly and efficiently operate in complex ...
research
01/30/2023

Hierarchical Imitation Learning with Vector Quantized Models

The ability to plan actions on multiple levels of abstraction enables in...
research
10/07/2018

Hierarchical Optimization for Whole-Body Control of Wheeled Inverted Pendulum Humanoids

In this paper, we present a whole-body control framework for Wheeled Inv...
research
06/08/2022

Deep Hierarchical Planning from Pixels

Intelligent agents need to select long sequences of actions to solve com...
research
02/25/2022

Exploring with Sticky Mittens: Reinforcement Learning with Expert Interventions via Option Templates

Environments with sparse rewards and long horizons pose a significant ch...

Please sign up or login with your details

Forgot password? Click here to reset