Feature importance scores and lossless feature pruning using Banzhaf power indices

11/14/2017
by   Bogdan Kulynych, et al.
0

Understanding the influence of features in machine learning is crucial to interpreting models and selecting the best features for classification. In this work we propose the use of principles from coalitional game theory to reason about importance of features. In particular, we propose the use of the Banzhaf power index as a measure of influence of features on the outcome of a classifier. We show that features having Banzhaf power index of zero can be losslessly pruned without damage to classifier accuracy. Computing the power indices does not require having access to data samples. However, if samples are available, the indices can be empirically estimated. We compute Banzhaf power indices for a neural network classifier on real-life data, and compare the results with gradient-based feature saliency, and coefficients of a logistic regression model with L_1 regularization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2018

The power of the largest player

Decisions in a shareholder meeting or a legislative committee are often ...
research
03/04/2019

Attacking Power Indices by Manipulating Player Reliability

We investigate the manipulation of power indices in TU-cooperative games...
research
07/19/2020

Reconstructing weighted voting schemes from partial information about their power indices

A number of recent works [Goldberg 2006; O'Donnell and Servedio 2011; De...
research
11/01/2022

Pi theorem formulation of flood mapping

While physical phenomena are stated in terms of physical laws that are h...
research
06/01/2022

The statistical nature of h-index of a network node

Evaluating the importance of a network node is a crucial task in network...
research
08/05/2019

FLuID: A Meta Model to Flexibly Define Schema-level Indices for the Web of Data

Schema-level indices are vital for summarizing large collections of grap...
research
06/14/2018

Direct Automated Quantitative Measurement of Spine via Cascade Amplifier Regression Network

Automated quantitative measurement of the spine (i.e., multiple indices ...

Please sign up or login with your details

Forgot password? Click here to reset