Towards Behavior-Level Explanation for Deep Reinforcement Learning

09/17/2020
by   Xuan Chen, et al.
23

While Deep Neural Networks (DNNs) are becoming the state-of-the-art for many tasks including reinforcement learning (RL), they are especially resistant to human scrutiny and understanding. Input attributions have been a foundational building block for DNN expalainabilty but face new challenges when applied to deep RL. We address the challenges with two novel techniques. We define a class of behaviour-level attributions for explaining agent behaviour beyond input importance and interpret existing attribution methods on the behaviour level. We then introduce λ-alignment, a metric for evaluating the performance of behaviour-level attributions methods in terms of whether they are indicative of the agent actions they are meant to explain. Our experiments on Atari games suggest that perturbation-based attribution methods are significantly more suitable to deep RL than alternatives from the perspective of this metric. We argue that our methods demonstrate the minimal set of considerations for adopting general DNN explanation technology to the unique aspects of reinforcement learning and hope the outlined direction can serve as a basis for future research on understanding Deep RL using attribution.

READ FULL TEXT

page 5

page 6

page 10

page 12

page 13

research
05/18/2022

Generating Explanations from Deep Reinforcement Learning Using Episodic Memory

Deep Reinforcement Learning (RL) involves the use of Deep Neural Network...
research
09/04/2023

Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning

The black-box nature of deep reinforcement learning (RL) hinders them fr...
research
09/30/2014

An agent-driven semantical identifier using radial basis neural networks and reinforcement learning

Due to the huge availability of documents in digital form, and the decep...
research
04/24/2020

Self-Paced Deep Reinforcement Learning

Generalization and reuse of agent behaviour across a variety of learning...
research
06/05/2016

Deep Q-Networks for Accelerating the Training of Deep Neural Networks

In this paper, we propose a principled deep reinforcement learning (RL) ...
research
10/24/2022

Causal Explanation for Reinforcement Learning: Quantifying State and Temporal Importance

Explainability plays an increasingly important role in machine learning....
research
06/20/2017

Observational Learning by Reinforcement Learning

Observational learning is a type of learning that occurs as a function o...

Please sign up or login with your details

Forgot password? Click here to reset