Explaining Reinforcement Learning Policies through Counterfactual Trajectories

01/29/2022
by   Julius Frost, et al.
0

In order for humans to confidently decide where to employ RL agents for real-world tasks, a human developer must validate that the agent will perform well at test-time. Some policy interpretability methods facilitate this by capturing the policy's decision making in a set of agent rollouts. However, even the most informative trajectories of training time behavior may give little insight into the agent's behavior out of distribution. In contrast, our method conveys how the agent performs under distribution shifts by showing the agent's behavior across a wider trajectory distribution. We generate these trajectories by guiding the agent to more diverse unseen states and showing the agent's behavior there. In a user study, we demonstrate that our method enables users to score better than baseline methods on one of two agent validation tasks.

READ FULL TEXT

page 3

page 4

research
10/10/2022

Experiential Explanations for Reinforcement Learning

Reinforcement Learning (RL) approaches are becoming increasingly popular...
research
04/21/2020

Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness

Most existing approaches for goal-oriented dialogue policy learning used...
research
03/08/2023

RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning

While reinforcement learning (RL) algorithms have been successfully appl...
research
03/08/2022

Policy Regularization for Legible Behavior

In Reinforcement Learning interpretability generally means to provide in...
research
05/06/2023

Explaining RL Decisions with Trajectories

Explanation is a key component for the adoption of reinforcement learnin...
research
07/12/2023

Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation

Policies often fail due to distribution shift – changes in the state and...
research
05/30/2019

Exploring Computational User Models for Agent Policy Summarization

AI agents are being developed to support high stakes decision-making pro...

Please sign up or login with your details

Forgot password? Click here to reset