On the Complexity of SHAP-Score-Based Explanations: Tractability via Knowledge Compilation and Non-Approximability Results

04/16/2021
βˆ™
by   Marcelo Arenas, et al.
βˆ™
0
βˆ™

In Machine Learning, the 𝖲𝖧𝖠𝖯-score is a version of the Shapley value that is used to explain the result of a learned model on a specific entity by assigning a score to every feature. While in general computing Shapley values is an intractable problem, we prove a strong positive result stating that the 𝖲𝖧𝖠𝖯-score can be computed in polynomial time over deterministic and decomposable Boolean circuits. Such circuits are studied in the field of Knowledge Compilation and generalize a wide range of Boolean circuits and binary decision diagrams classes, including binary decision trees and Ordered Binary Decision Diagrams (OBDDs). We also establish the computational limits of the SHAP-score by observing that computing it over a class of Boolean models is always polynomially as hard as the model counting problem for that class. This implies that both determinism and decomposability are essential properties for the circuits that we consider. It also implies that computing 𝖲𝖧𝖠𝖯-scores is intractable as well over the class of propositional formulas in DNF. Based on this negative result, we look for the existence of fully-polynomial randomized approximation schemes (FPRAS) for computing 𝖲𝖧𝖠𝖯-scores over such class. In contrast to the model counting problem for DNF formulas, which admits an FPRAS, we prove that no such FPRAS exists for the computation of 𝖲𝖧𝖠𝖯-scores. Surprisingly, this negative result holds even for the class of monotone formulas in DNF. These techniques can be further extended to prove another strong negative result: Under widely believed complexity assumptions, there is no polynomial-time algorithm that checks, given a monotone DNF formula Ο† and features x,y, whether the 𝖲𝖧𝖠𝖯-score of x in Ο† is smaller than the 𝖲𝖧𝖠𝖯-score of y in Ο†.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 07/28/2020

The Tractability of SHAP-scores over Deterministic and Decomposable Boolean Circuits

Scores based on Shapley values are currently widely used for providing e...
research
βˆ™ 06/18/2020

On the complexity of detecting hazards

Detecting and eliminating logic hazards in Boolean circuits is a fundame...
research
βˆ™ 02/12/2021

Proof complexity of positive branching programs

We investigate the proof complexity of systems based on positive branchi...
research
βˆ™ 05/14/2023

A Unifying Formal Approach to Importance Values in Boolean Functions

Boolean functions and their representation through logics, circuits, mac...
research
βˆ™ 08/28/2023

A polynomial quantum computing algorithm for solving the dualization problem

Given two prime monotone boolean functions f:{0,1}^n β†’{0,1} and g:{0,1}^...
research
βˆ™ 09/13/2020

Understanding Boolean Function Learnability on Deep Neural Networks

Computational learning theory states that many classes of boolean formul...
research
βˆ™ 02/11/2018

Recognizing Read-Once Functions from Depth-Three Formulas

Consider the following decision problem: for a given monotone Boolean fu...

Please sign up or login with your details

Forgot password? Click here to reset