Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning

01/21/2022
by   Tongzhou Mu, et al.
0

We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem with graph-based input. Unlike prior deep reinforcement learning policies parameterized by an end-to-end black-box graph neural network, our approach disentangles the decision-making process into two steps. The first step is a simplified classification problem that maps the graph input to an action group where all actions share a similar semantic meaning. The second step implements a sophisticated rule-miner that conducts explicit one-hop reasoning over the graph and identifies decisive edges in the graph input without the necessity of heavy domain knowledge. This two-step hybrid policy presents human-friendly interpretations and achieves better performance in terms of generalization and robustness. Extensive experimental studies on four levels of complex text-based games have demonstrated the superiority of the proposed method compared to the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2020

Reinforcement Learning from a Mixture of Interpretable Experts

Reinforcement learning (RL) has demonstrated its ability to solve high d...
research
09/24/2021

A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems

Volt-var control (VVC) is the problem of operating power distribution sy...
research
10/22/2020

Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

We study reinforcement learning (RL) for text-based games, which are int...
research
09/21/2021

Generalization in Text-based Games via Hierarchical Reinforcement Learning

Deep reinforcement learning provides a promising approach for text-based...
research
05/20/2018

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

We present a new technique for deep reinforcement learning that automati...
research
11/19/2022

Prediction-aware and Reinforcement Learning based Altruistic Cooperative Driving

Autonomous vehicle (AV) navigation in the presence of Human-driven vehic...
research
04/10/2018

Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning

We investigate a novel approach for image restoration by reinforcement l...

Please sign up or login with your details

Forgot password? Click here to reset