Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

10/11/2020
by   Zhengxian Lin, et al.
0

We investigate a deep reinforcement learning (RL) architecture that supports explaining why a learned agent prefers one action over another. The key idea is to learn action-values that are directly represented via human-understandable properties of expected futures. This is realized via the embedded self-prediction (ESP)model, which learns said properties in terms of human provided features. Action preferences can then be explained by contrasting the future properties predicted for each action. To address cases where there are a large number of features, we develop a novel method for computing minimal sufficient explanations from anESP. Our case studies in three domains, including a complex strategy game, show that ESP models can be effectively learned and support insightful explanations.

READ FULL TEXT

page 7

page 8

page 18

page 20

research
11/14/2022

(When) Are Contrastive Explanations of Reinforcement Learning Helpful?

Global explanations of a reinforcement learning (RL) agent's expected be...
research
07/23/2018

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Machine Learning models become increasingly proficient in complex tasks....
research
06/09/2023

Explaining Reinforcement Learning with Shapley Values

For reinforcement learning systems to be widely adopted, their users mus...
research
05/14/2021

Feature-Based Interpretable Reinforcement Learning based on State-Transition Models

Growing concerns regarding the operational usage of AI models in the rea...
research
11/10/2020

What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes

We present a novel form of explanation for Reinforcement Learning, based...
research
05/28/2019

Generation of Policy-Level Explanations for Reinforcement Learning

Though reinforcement learning has greatly benefited from the incorporati...
research
11/25/2017

Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning

We build a deep reinforcement learning (RL) agent that can predict the l...

Please sign up or login with your details

Forgot password? Click here to reset