Symbolic Metamodels for Interpreting Black-boxes Using Primitive Functions

02/09/2023
by   Mahed Abroshan, et al.
0

One approach for interpreting black-box machine learning models is to find a global approximation of the model using simple interpretable functions, which is called a metamodel (a model of the model). Approximating the black-box with a metamodel can be used to 1) estimate instance-wise feature importance; 2) understand the functional form of the model; 3) analyze feature interactions. In this work, we propose a new method for finding interpretable metamodels. Our approach utilizes Kolmogorov superposition theorem, which expresses multivariate functions as a composition of univariate functions (our primitive parameterized functions). This composition can be represented in the form of a tree. Inspired by symbolic regression, we use a modified form of genetic programming to search over different tree configurations. Gradient descent (GD) is used to optimize the parameters of a given configuration. Our method is a novel memetic algorithm that uses GD not only for training numerical constants but also for the training of building blocks. Using several experiments, we show that our method outperforms recent metamodeling approaches suggested for interpreting black-boxes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2020

Learning outside the Black-Box: The pursuit of interpretable models

Machine Learning has proved its ability to produce accurate models but t...
research
10/12/2018

Explaining Black Boxes on Sequential Data using Weighted Automata

Understanding how a learned black box works is of crucial interest for t...
research
07/02/2020

Am I Building a White Box Agent or Interpreting a Black Box Agent?

The rule extraction literature contains the notion of a fidelity-accurac...
research
06/13/2022

Symbolic Regression for Space Applications: Differentiable Cartesian Genetic Programming Powered by Multi-objective Memetic Algorithms

Interpretable regression models are important for many application domai...
research
11/22/2019

Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution

Learning models with discrete latent variables using stochastic gradient...
research
01/12/2018

Combining Symbolic and Function Evaluation Expressions In Neural Programs

Neural programming involves training neural networks to learn programs f...
research
10/23/2018

Interpreting Black Box Predictions using Fisher Kernels

Research in both machine learning and psychology suggests that salient e...

Please sign up or login with your details

Forgot password? Click here to reset