Counterfactual States for Atari Agents via Generative Deep Learning

09/27/2019
by   Matthew L. Olson, et al.
48

Although deep reinforcement learning agents have produced impressive results in many domains, their decision making is difficult to explain to humans. To address this problem, past work has mainly focused on explaining why an action was chosen in a given state. A different type of explanation that is useful is a counterfactual, which deals with "what if?" scenarios. In this work, we introduce the concept of a counterfactual state to help humans gain a better understanding of what would need to change (minimally) in an Atari game image for the agent to choose a different action. We introduce a novel method to create counterfactual states from a generative deep learning architecture. In addition, we evaluate the effectiveness of counterfactual states on human participants who are not machine learning experts. Our user study results suggest that our generated counterfactual states are useful in helping non-expert participants gain a better understanding of an agent's decision making process.

READ FULL TEXT

page 1

page 4

page 5

page 6

research
01/29/2021

Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning

Counterfactual explanations, which deal with "why not?" scenarios, can p...
research
02/24/2023

GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations

Counterfactual explanations are a common tool to explain artificial inte...
research
08/13/2019

Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking

Counterfactual thinking describes a psychological phenomenon that people...
research
05/18/2020

Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps

With advances in reinforcement learning (RL), agents are now being devel...
research
07/31/2022

PASTA: A Dataset for Modeling Participant States in Narratives

The events in a narrative can be understood as a coherent whole via the ...
research
01/29/2021

Counterfactual Planning in AGI Systems

We present counterfactual planning as a design approach for creating a r...
research
06/15/2023

Inroads into Autonomous Network Defence using Explained Reinforcement Learning

Computer network defence is a complicated task that has necessitated a h...

Please sign up or login with your details

Forgot password? Click here to reset