(When) Are Contrastive Explanations of Reinforcement Learning Helpful?

11/14/2022
by   Sanjana Narayanan, et al.
0

Global explanations of a reinforcement learning (RL) agent's expected behavior can make it safer to deploy. However, such explanations are often difficult to understand because of the complicated nature of many RL policies. Effective human explanations are often contrastive, referencing a known contrast (policy) to reduce redundancy. At the same time, these explanations also require the additional effort of referencing that contrast when evaluating an explanation. We conduct a user study to understand whether and when contrastive explanations might be preferable to complete explanations that do not require referencing a contrast. We find that complete explanations are generally more effective when they are the same size or smaller than a contrastive explanation of the same policy, and no worse when they are larger. This suggests that contrastive explanations are not sufficient to solve the problem of effectively explaining reinforcement learning policies, and require additional careful study for use in this context.

READ FULL TEXT
research
07/13/2022

Policy Optimization with Sparse Global Contrastive Explanations

We develop a Reinforcement Learning (RL) framework for improving an exis...
research
07/23/2018

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Machine Learning models become increasingly proficient in complex tasks....
research
10/11/2020

Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

We investigate a deep reinforcement learning (RL) architecture that supp...
research
11/10/2020

What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes

We present a novel form of explanation for Reinforcement Learning, based...
research
03/16/2020

Towards Transparent Robotic Planning via Contrastive Explanations

Providing explanations of chosen robotic actions can help to increase th...
research
07/17/2020

Sequential Explanations with Mental Model-Based Policies

The act of explaining across two parties is a feedback loop, where one p...
research
01/27/2022

Diagnosing AI Explanation Methods with Folk Concepts of Behavior

When explaining AI behavior to humans, how is the communicated informati...

Please sign up or login with your details

Forgot password? Click here to reset