Counterfactual Explanation Policies in RL

07/25/2023
by   Shripad V. Deshmukh, et al.
0

As Reinforcement Learning (RL) agents are increasingly employed in diverse decision-making problems using reward preferences, it becomes important to ensure that policies learned by these frameworks in mapping observations to a probability distribution of the possible actions are explainable. However, there is little to no work in the systematic understanding of these complex policies in a contrastive manner, i.e., what minimal changes to the policy would improve/worsen its performance to a desired level. In this work, we present COUNTERPOL, the first framework to analyze RL policies using counterfactual explanations in the form of minimal changes to the policy that lead to the desired outcome. We do so by incorporating counterfactuals in supervised learning in RL with the target outcome regulated using desired return. We establish a theoretical connection between Counterpol and widely used trust region-based policy optimization methods in RL. Extensive empirical analysis shows the efficacy of COUNTERPOL in generating explanations for (un)learning skills while keeping close to the original policy. Our results on five different RL environments with diverse state and action spaces demonstrate the utility of counterfactual explanations, paving the way for new frontiers in designing and developing counterfactual policies.

READ FULL TEXT

page 7

page 8

research
02/24/2023

GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations

Counterfactual explanations are a common tool to explain artificial inte...
research
02/23/2023

Diverse Policy Optimization for Structured Action Space

Enhancing the diversity of policies is beneficial for robustness, explor...
research
07/13/2022

Policy Optimization with Sparse Global Contrastive Explanations

We develop a Reinforcement Learning (RL) framework for improving an exis...
research
03/08/2023

"How to make them stay?" – Diverse Counterfactual Explanations of Employee Attrition

Employee attrition is an important and complex problem that can directly...
research
05/27/2022

Non-Markovian policies occupancy measures

A central object of study in Reinforcement Learning (RL) is the Markovia...
research
12/16/2020

Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation

Reinforcement learning (RL) algorithms usually require a substantial amo...
research
10/21/2022

Counterfactual Explanations for Reinforcement Learning

While AI algorithms have shown remarkable success in various fields, the...

Please sign up or login with your details

Forgot password? Click here to reset