Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents

10/21/2022
by   Yael Septon, et al.
0

Explaining the behavior of reinforcement learning agents operating in sequential decision-making settings is challenging, as their behavior is affected by a dynamic environment and delayed rewards. Methods that help users understand the behavior of such agents can roughly be divided into local explanations that analyze specific decisions of the agents and global explanations that convey the general strategy of the agents. In this work, we study a novel combination of local and global explanations for reinforcement learning agents. Specifically, we combine reward decomposition, a local explanation method that exposes which components of the reward function influenced a specific decision, and HIGHLIGHTS, a global explanation method that shows a summary of the agent's behavior in decisive states. We conducted two user studies to evaluate the integration of these explanation methods and their respective benefits. Our results show significant benefits for both methods. In general, we found that the local reward decomposition was more useful for identifying the agents' priorities. However, when there was only a minor difference between the agents' preferences, then the global information provided by HIGHLIGHTS additionally improved participants' understanding.

READ FULL TEXT

page 4

page 7

page 9

page 10

research
05/18/2020

Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps

With advances in reinforcement learning (RL), agents are now being devel...
research
04/25/2023

A Closer Look at Reward Decomposition for High-Level Robotic Explanations

Explaining the behavior of intelligent agents such as robots to humans i...
research
01/06/2021

One-shot Policy Elicitation via Semantic Reward Manipulation

Synchronizing expectations and knowledge about the state of the world is...
research
03/17/2019

Model-Free Model Reconciliation

Designing agents capable of explaining complex sequential decisions rema...
research
10/10/2022

Experiential Explanations for Reinforcement Learning

Reinforcement Learning (RL) approaches are becoming increasingly popular...
research
09/27/2021

Exploring The Role of Local and Global Explanations in Recommender Systems

Explanations are well-known to improve recommender systems' transparency...
research
03/22/2019

Explaining Reinforcement Learning to Mere Mortals: An Empirical Study

We present a user study to investigate the impact of explanations on non...

Please sign up or login with your details

Forgot password? Click here to reset