From proprioception to long-horizon planning in novel environments: A hierarchical RL model

06/11/2020
by   Nishad Gothoskar, et al.
0

For an intelligent agent to flexibly and efficiently operate in complex environments, they must be able to reason at multiple levels of temporal, spatial, and conceptual abstraction. At the lower levels, the agent must interpret their proprioceptive inputs and control their muscles, and at the higher levels, the agent must select goals and plan how they will achieve those goals. It is clear that each of these types of reasoning is amenable to different types of representations, algorithms, and inputs. In this work, we introduce a simple, three-level hierarchical architecture that reflects these distinctions. The low-level controller operates on the continuous proprioceptive inputs, using model-free learning to acquire useful behaviors. These in turn induce a set of mid-level dynamics, which are learned by the mid-level controller and used for model-predictive control, to select a behavior to activate at each timestep. The high-level controller leverages a discrete, graph representation for goal selection and path planning to specify targets for the mid-level controller. We apply our method to a series of navigation tasks in the Mujoco Ant environment, consistently demonstrating significant improvements in sample-efficiency compared to prior model-free, model-based, and hierarchical RL methods. Finally, as an illustrative example of the advantages of our architecture, we apply our method to a complex maze environment that requires efficient exploration and long-horizon planning.

READ FULL TEXT
research
10/09/2021

Interactive Hierarchical Guidance using Language

Reinforcement learning has been successful in many tasks ranging from ro...
research
09/23/2021

Hierarchies of Planning and Reinforcement Learning for Robot Navigation

Solving robotic navigation tasks via reinforcement learning (RL) is chal...
research
10/31/2019

Object-oriented state editing for HRL

We introduce agents that use object-oriented reasoning to consider alter...
research
02/25/2022

Hierarchical Control for Multi-Agent Autonomous Racing

We develop a hierarchical controller for multi-agent autonomous racing. ...
research
07/07/2023

Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning

Discovering achievements with a hierarchical structure on procedurally g...
research
02/11/2023

Hierarchical control and learning of a foraging CyberOctopus

Inspired by the unique neurophysiology of the octopus, we propose a hier...
research
11/03/2020

Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning

We consider the problem of security-aware planning in an unknown stochas...

Please sign up or login with your details

Forgot password? Click here to reset