On The Reasons Behind Decisions

02/21/2020
by   Adnan Darwiche, et al.
0

Recent work has shown that some common machine learning classifiers can be compiled into Boolean circuits that have the same input-output behavior. We present a theory for unveiling the reasons behind the decisions made by Boolean classifiers and study some of its theoretical and practical implications. We define notions such as sufficient, necessary and complete reasons behind decisions, in addition to classifier and decision bias. We show how these notions can be used to evaluate counterfactual statements such as "a decision will stick even if ... because ... ." We present efficient algorithms for computing these notions, which are based on new advances on tractable Boolean circuits, and illustrate them using a case study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2020

On Symbolically Encoding the Behavior of Random Forests

Recent work has shown that the input-output behavior of some machine lea...
research
05/12/2021

Sufficient reasons for classifier decisions in the presence of constraints

Recent work has unveiled a theory for reasoning about the decisions made...
research
04/28/2023

A New Class of Explanations for Classifiers with Non-Binary Features

Two types of explanations have received significant attention in the lit...
research
10/20/2022

Modelling and Explaining Legal Case-based Reasoners through Classifiers

This paper brings together two lines of research: factor-based models of...
research
03/20/2022

On the Computation of Necessary and Sufficient Explanations

The complete reason behind a decision is a Boolean formula that characte...
research
05/14/2023

A Unifying Formal Approach to Importance Values in Boolean Functions

Boolean functions and their representation through logics, circuits, mac...
research
05/09/2023

Logic for Explainable AI

A central quest in explainable AI relates to understanding the decisions...

Please sign up or login with your details

Forgot password? Click here to reset