Diversity-Driven Extensible Hierarchical Reinforcement Learning

11/10/2018
by   Yuhang Song, et al.
0

Hierarchical reinforcement learning (HRL) has recently shown promising advances on speeding up learning, improving the exploration, and discovering intertask transferable skills. Most recent works focus on HRL with two levels, i.e., a master policy manipulates subpolicies, which in turn manipulate primitive actions. However, HRL with multiple levels is usually needed in many real-world scenarios, whose ultimate goals are highly abstract, while their actions are very primitive. Therefore, in this paper, we propose a diversity-driven extensible HRL (DEHRL), where an extensible and scalable framework is built and learned levelwise to realize HRL with multiple levels. DEHRL follows a popular assumption: diverse subpolicies are useful, i.e., subpolicies are believed to be more useful if they are more diverse. However, existing implementations of this diversity assumption usually have their own drawbacks, which makes them inapplicable to HRL with multiple levels. Consequently, we further propose a novel diversity-driven solution to achieve this assumption in DEHRL. Experimental studies evaluate DEHRL with five baselines from four perspectives in two domains; the results show that DEHRL outperforms the state-of-the-art baselines in all four aspects.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

research
07/12/2022

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Recent algorithms designed for reinforcement learning tasks focus on fin...
research
10/18/2022

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

In real-world environments, robots need to be resilient to damages and r...
research
08/23/2023

Diverse Policies Converge in Reward-free Markov Decision Processe

Reinforcement learning has achieved great success in many decision-makin...
research
11/11/2022

Emergency action termination for immediate reaction in hierarchical reinforcement learning

Hierarchical decomposition of control is unavoidable in large dynamical ...
research
06/06/2021

DisTop: Discovering a Topological representation to learn diverse and rewarding skills

The optimal way for a deep reinforcement learning (DRL) agent to explore...
research
06/21/2019

Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction

Deep reinforcement learning has made significant progress in the field o...
research
03/20/2019

Reinforcing Classical Planning for Adversary Driving Scenarios

Adversary scenarios in driving, where the other vehicles may make mistak...

Please sign up or login with your details

Forgot password? Click here to reset